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NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL FOR 
CELL VIABILITY AND THEIR USES 



Throughout this application various publications are referenced. The disclosures of these 
publications in their entireties are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this invention pertains. 

FIELD OF THE INVENTION 

The present invention relates generally to nucleotide sequences, and polypeptides 
encoded by the sequences, that are essential for bacterial viability, and to methods of 
using the nucleotide and polypeptide sequences. 

BACKGROUND OF THE INVENTION 

Bacterial genera, such as Streptococcus, Staphylococcus, Pseudomonas, Yersinia, 
Salmonella, and Enterobacter, are the cause of numerous afflictions in humans and 
animals. Bacterial infection can lead to serious health conditions, including pneumonia, 
osteomyelitis, meningitis, sinusitis, otitis, cystitis, and even food poisoning. Typically, 
these infections can be treated with standard antimicrobial agents such as antibiotics. 
However, the emergence of pathogenic bacterial strains that are resistant to antibiotics 
has risen alarmingly in the past two decades. This situation has created an urgent need 
for the development of new antimicrobial agents. 

One strategy for developing new antimicrobial agents is to identify bacterial gene 
sequences that encode gene products that are essential for bacterial cell viability and 



1 



WO 01/49721 



PCT/USOO/35604 



develop and/or identify agents which inhibit the function of the gene product. DNA 
sequencing technology has advanced from sequencing one gene at a time to sequencing 
entire genomes, the sum of all genes in an organism. With the recent arrival of bacterial 
genomic information, it is now possible to compare multiple bacterial genomes in an 
5 attempt to identify genes that encode conserved gene products. In this manner, one 
skilled in the art may identify a set of conserved bacterial genes, including a subset of 
genes that are essential for bacterial cell viability. The essential gene is then used as a 
starting point to develop therapeutic agents that inhibit or inactivate the product of the 
essential gene. 

10 

The availability of DNA sequence information for multiple microbial genomes is a recent 
development. The public release of the first complete genome, Haemophilus influenzae 
(Fleischmann, R.D., et al. 1995 Science 269:496-512 ), was followed in rapid succession 
by a number of public and private genome sequencing programs. Presently, some 20 
15 completely sequenced bacterial genomes have been published, and over 100 other 
sequencing projects are underway (Blattner, F.R., et al., 1997 Science 277:1453-74; 
Ferretti, J.J., et al., 1997 Adv Exp Med Biol 418:961-963; Koonin, E.V., et al., 1996 . 
Methods Enzymol 266:295-322). Analyses of these data indicate that approximately 
46% of putative bacterial genes are of unknown function having no attributable function. 

20 

Others have pursued various strategies to identify bacterial genes that are essential for 
viability. These strategies include: identifying genes that are expressed by the bacteria 
when present in the infected host (Hensel," M., et al., 1995 Science 269:400-3), . 
identifying essential genes by isolating temperature sensitive mutants (Schmid, M.B., et 
25 al., 1998 Curr Opin Chem Biol 2:529-34), and identifying genes in pathways known 
from prior physiological studies to be essential (Skarzynski, T. et al., 1996 Structure 
1996 4:1465-74) 

There continues to be a need to identify bacterial genes that encode gene products that are 
30 essential for cell viability, such as cell replication, growth, and survival. These genes and 
their encoded gene products can be used as a starting point towards identifying agents 
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that inhibit functions essential for cell viability, thereby causing bacterial cell stasis or 
death (e.g., antibacterial agents). 

The present invention provides experimental identification of novel, conserved essential 
5 genes (ceg) from bacteria and their encoded protein products. The ceg genes are 
considered essential to cell viability because disruption of an endogenous ceg gene results 
in lethality of a bacterial cell (e.g., as determined by failure to recover viable 
chloramphenicol-resistant colonies, as described herein). Thus, the gene products 
encoded by these genes are potentially valuable targets for chemotherapeutic intervention 
10 of bacterial infections. 

The ceg nucleotide sequences of the invention were obtained by large-scale 
computational comparisons of multiple genome sequences to identify conserved protein 
coding regions, followed by gene disruption to identify cegs. The conservation of protein 
15 sequences in many cases is believed to reflect the higher level conservation of common 
biochemical pathways essential for bacterial function and viability. 

SUMMARY OF THE INVENTION 

20 The acronyms "CEG" and "ceg" stand for Conserved Essential Gene. For convenience, 
the italicized term ceg refers herein to ceg nucleotide sequences. The capitalized term CEG 
refers herein to CEG polypeptide sequences. 

Embodiments of the ceg nucleotide sequences and the CEG polypeptide sequences are 
. 25 designated CFEs which stands for CEG For Expression. The CFEs are polypeptides 
resulting from expression of the ceg nucleotide sequence. 

The. present invention provides isolated nucleotide sequences of conserved essential 
genes from bacteria, designated ceg. The invention also provides recombinant nucleic 
30 acid molecules including the ceg sequences of the invention, and methods of uses thereof. 
Examples of nucleic acid molecules having ceg sequences are described in SEQ ID 
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NOS.: 1-113. The invention further provides isolated polypeptides and recombinant 
polypeptides having the .CEG sequences of the invention, and methods of uses thereof. 
Examples of polypeptides having CEG sequences are described in SEQ ID NOS. : 114- 
226. 

5 

The ceg sequences of the present invention are DNA or RNA. Further, the invention 
includes nucleic acid molecules that are identical or nearly identical (e.g., similar) with 
the ceg sequences of the invention. The invention additionally provides polynucleotide 
sequences that hybridize under stringent conditions to the ceg sequences of the invention. 
10 A further embodiment provides polynucleotide sequences which are complementary to 
the ceg sequences of the invention. Yet another embodiment provides ceg nucleic acid 
molecules that are labeled with a detectable marker. Another embodiment provides 
recombinant nucleic acid molecules, such as a vector or a fusion molecule, including the 
ceg sequences of the invention. 
15 * 

The present invention provides various ceg sequences, fragments thereof having essential 
gene activity, and related molecules such as antisense molecules, oligonucleotides, 
peptide nucleic acids (PNA), fragments, and portions thereof. 

The present invention relates to the inclusion of the polynucleotides encoding CEG gene 
products, such as CEG polypeptides, in an expression vector which can be used to 
transform host cells or organisms. Such transgenic hosts are useful for the production of 
CEG gene products for the development of antibacterial agents such as antibiotics. 

25 The invention further provides substantially purified CEG gene products, and uses 
thereof. 

The invention also relates to pharmaceutical compositions comprising antisense 
molecules capable of disrupting expression of ceg sequences, agonists, antagonists or 
30 inhibitors of CEG gene products, and antibodies reactive against the CEG polypeptides. 
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These compositions are useful for preventing the growth or survival of bacteria, for 
example, in the treatment of conditions associated with bacterial infections. 

BRIEF DESCRIPTION OF THE FIGURES 

5 

Figure 1: A schematic representation of the gene disruption assay, as described in Example 
3, infra. A) A recombinant vector undergoing homologous recombination with the host 
genome. B) The result of homologous recombination. 

10 Figure 2: A schematic representation of the polarity test for operons, as described in 
Examples 2 and 3, infra. A) The recombinant vector undergoing homologous 
recombination with the host genome. B) Case 1: one possible result of homologous 
recombination; the downstream Gene B has an independent promoter. C) Case 2; another 
possible result of homologous recombination; the downstream Gene B does not have an 

1 5 independent promoter. 

Figure 3: Purification of 2CFE 75, as described in Example 6, infra. A) Fractionation 
profile of 2CFE 75 eluted from a Ni-NTA column. B) Gel electrophoresis of pooled 
fractions of CFE 75. C) Non-denaturing gel electrophoresis to determine oligo form of 
20 2CFE 75. 

Figure 4: Fractionation profile of 2CFE 3 eluted from a hydroxyapatite column, as described 
in Example 7, infra. 

25 Figure 5: The biosynthesis pathway of Coenzyme A which starts with phosphorylation of 
pantothenate. 

Figure 6: Circular dichroism spectra of 2CFE 101 and 103, as described in Example 10, 
infra. A) Circular dichroism spectra of 2CFE 101 and 103 at 25 degrees C. B) Circular 
30 dichroism thermal melt spectra of 2CFE 101 and 103 at a range of zero to 100 degrees C. 
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Figure 7: Circular dichroism spectra of aggregate and monomer pools of 2CFE 101 and 103, 
as described in Example 10, infra. A) Circular dichroism spectra of aggregate and monomer 
pools of 2CFE 101 and 103 at 25 degrees C. B) Circular dichroism thermal melt spectra of 
aggregate and monomer pools of 2CFE 101 and 103 at a range of zero to 100 degrees C. 

5 

Figure 8: Absorbance spectra of pantothenate-dependent production of ADP, as described in 
Example 10, infra.* 

Figure 9: The results of size exclusion chromatography and gel electrophoresis showing the 
10 oligomeric forms of 2CFE 21 and 39, as described in Example 11, infra. Lanes 1-6 contain 
2CFE 21, lane 7 is a molecular weight marker, lanes 8-10 contain 2CFE 39. 

Figure 10: Gel electrophoresis of a helicase reaction using 2CFE 21 and 39 and radiolabeled 
synthetic Hoiliday Junction template, as described in Example 11, infra. Lane 1 contains 

15 the synthetic Hoiliday Junction template; lane 2 contains the synthetic duplex; lane 3 
contains a single-stranded template; lane 4 contains the helicase reaction using 2CFE 39; 
lane 5 contains the helicase reaction using 2CFE 21; lanes 6-8 contain the helicase reaction 
using 2CFE 39 and 21 at varying concentrations (e.g., 1, 2, and 3 jjM each); and lane 9 
contains the helicase reaction using 2 \xM each 2CFE 39 and 21 in the presence of ethidium 

20 bromide. 

Figure 1 1: A graph depicting the results of the helicase reaction which were monitored by . 
measuring the unquenching of the Hoiliday Junction templates with time, as described in 
Example 1.1, infra. 

25 

Figure 12: Capillary electrophoresis results, of 2CFE 8 with and without ssDNA, as 
described in Example 12, infra. A) Electropherogram of 2CFE 8 alone: B) 
Electropherogram of 2CFE 8 in the presence of a 32-nucleotide single-stranded oligomer. 

.30 Figure 13: Gel mobility shift assay of 2CFE 8, and 2CFE 8 in the presence of a single- 
stranded 32-mer, as described in Example 12, infra. . A) An ethidium bromide-stained, 
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native, polyaciylamide gel containing 2CFE 8, and 2CFE 8 in the presence of a 32-mer. B) 
The same native, polyacrylamide gel stained with Coomassie. 

Figure 14: The N-acetyl glucosamine pathway putatively mediated by 2CFE 3 and 2CFE 
5 86, as described in Example 13, infra. 

Figure 15: Capillary electrophoresis results of 2CFE 3 with and without putative substrates, 
as described in Example 13, infra.. A) Electropherogram of 2CFE 3 with .and without 
glucosamine- 1 -phosphate. B) Electropherogram of 2CFE 3 with and without D-glucose-1- 
10 phosphate. C) Electropherogram of 2CFE 3 alone, 2CFE 3 and glucose- 1 -phosphate, and 
2CFE 3 and glucose-6-phosphate. D) Electropherogram of 2CFE 3 alone or in the presence 
of glucosamine- 1 -phosphate, glucosamine-6-phosphate, D-glucose, D(+) galactose, and ct- 
D-glucose-1 -phosphate. 

15 Figure 16: Capillary electrophoresis results of FITC-derivitized 2CFE 3 polypeptide with 
and without D-glucosamine-6-phosphate (substrate) to produce the product D-glucosamine- 
1 -phosphate, using laser-induced fluorescence, as described in Example 13, infra. 
Electropherogram of D-glucosamine-6-phosphate (putative substrate), 2CFE 3 reacted with 
D-glucosamine-6-phosphate, and the product glucosamine- 1-phhosphate. 

20 

Figure 17: Gel electrophoresis of 2CFE 86 eluted from an Ni-NTA column, as described in 
Example 13, infra. 

Figure 18: HPLC analysis of a coupled reaction including 2CFE 3, 2CFE 86, and D- 
25 glucosamine-6-phosphate to produce the product, UDP-N-acetylglucosamine-1 -phosphate 
(UDPAG), as described in Example 13, infra. 

Figure 1 9: A fatty acid biosynthesis pathway. 
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Figure 20: Size exclusion chromatography to determine the molecular weight and 
oligomeric form of 2CFE 34, as described in Example 14, infra.. Selected eluted samples 
were sized by gel electrophoresis. 

Figure 21: Gelelectrophoresis of 2CFE 41 eluted from a Ni-NTA column, as described in 
Example 15, infra. 

Figure 22: Capillary electrophoresis results of 2CFE 40, 41 , and 46, as described in Example 
15, infra. 

Figure 23: Depicts a schematic diagram of a ligand which binds 2CFE 34. The Iigand is 2- 
pheriyl-N-(3 corboxyl-4hydroxyphenyl) azabicyclo [4.3.0] riona-2, 8-diene. 

Figure 24: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is N- 
(3, 5-dinitrobenzyl)-7-trifluoromethyl benza diaza furanolactone. 

Figure 25: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is 2- 
amino (N-para-methylphenyl sulfonamide)-3-phenylpropianic acid. 

Figure 26: A nucleic acid sequence of 2CFE1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 27: A nucleic acid sequence of 2CFE2 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 28: A nucleic acid sequence of 2CFE3 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 29: A nucleic acid sequence of 2CFE4. deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 30: A nucleic acid sequence of 2CFE5 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

5 Figure 31: A nucleic acid sequence of 2CFE6 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 32: A nucleic acid sequence of 2CFE7 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

10 

Figure 33: A nucleic acid sequence of 2CFE8 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000. 

Figure 34: A nucleic acid sequence of 2CFE9 deposited with the American Type Culture 
1 5 Collection as ATCC designation on December 20, 2000. 

Figure 35: A nucleic acid sequence of 2CFE10 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

20 Figure 36: A nucleic acid sequence of 2CFE1 1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 37: A nucleic acid sequence of 2CFE12 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 

Figure 38: A nucleic acid sequence of 2CFE13 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 39: A nucleic acid sequence of 2CFE14 deposited with the American Type Culture 
30 Collection as ATCC designation on December 20, 2000. 
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Figure 40: A nucleic acid sequence of 2CFE1 5 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 41 : A nucleic acid sequence of 2CFE16 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 42: A nucleic acid sequence of 2CFE17 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 43: A nucleic acid sequence of 2CFE19 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 44: A nucleic acid sequence of 2CFE21 deposited with the American Type Culture 
Collection as ATCC designation • • on December 20, 2000. 

15 

Figure 45: A nucleic acid sequence of 2CFE24 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 46: A nucleic acid sequence of 2CFE25 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 47: A nucleic acid sequence of 2CFE26 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 48: A nucleic acid sequence of 2CFE27 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 49: A nucleic acid sequence of 2CFE28 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 50: A nucleic acid sequence of 2CFE29 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 51 : A nucleic acid sequence of 2CFE30 deposited with the American Type Culture 
5 Collection as ATCC designation ■ " ' on December 20, 2000. 

Figure 52: A nucleic acid sequence of 2CFE31 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 53: A nucleic acid sequence of 2CFE32 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 54: A nucleic acid sequence of 2CFE33 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 55: A nucleic acid sequence of 2CFE34 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 56: A nucleic acid sequence of 2CFE35 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 57: A nucleic acid sequence of 2CFE36 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 58: A nucleic acid sequence of 2CFE37 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 59: A nucleic acid sequence of 2CFE38 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 60: A nucleic acid sequence of 2CFE39 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 61 : A nucleic acid sequence of 2CFE40 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 62: A nucleic acid sequence of 2CFE41 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 63: A nucleic acid sequence of 2CFE42 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 64: A nucleic acid sequence of 2CFE43 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000: 

15 

Figure 65: A nucleic acid sequence of 2CFE44 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 66: A nucleic acid sequence of 2CFE45 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 67: A nucleic acid sequence of 2CFE46 deposited with the American Type Culture 
Collection as ATCC designation - on December 20, 2000. 

25 Figure 68: A nucleic acid sequence of 2CFE47 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 69: A nucleic acid sequence of 2CFE48 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

30 
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Figure 70; A nucleic acid sequence of 2CFE49 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 71 : A nucleic acid sequence of 2CFE50 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 72: A nucleic acid sequence of 2CFE51 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

10 Figure 73: A nucleic acid sequence of 2CFE52 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 74: A nucleic acid sequence of 2CFE53 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 75: A nucleic acid sequence of 2CFE54 deposited with the American Type Culture 
Collection as ATCC designation ._ on December 20, 2000. 

Figure 76: A nucleic acid sequence of 2CFE55 deposited with the American Type Culture 
20 Collection as ATCC designation ._ on December 20, 2000. 

Figure 77: A nucleic acid sequence of 2CFE56 deposited with the American Type Culture * 
Collection as ATCC designation : on December 20, 2000. 

25 Figure 78: A nucleic acid sequence of 2CFE57 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 79: A nucleic acid sequence of 2CFE58 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 80: A nucleic acid sequence of 2CFE59 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 81: A nucleic acid sequence of 2CFE60 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 82: A nucleic acid sequence of 2CFE61 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 83: A nucleic acid sequence of 2CFE62 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 84: A nucleic acid sequence of 2CFE64 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 85: A nucleic acid sequence of 2CFE65 deposited with the American Type Culture 
Collection as ATCC designation - on December 20,,2000. 

Figure 86: A nucleic acid sequence of 2CFE66 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

Figure 87: A nucleic acid sequence of 2CFE67 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 88: A nucleic acid sequence of 2CFE68 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 89: A nucleic acid sequence of 2CFE69 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 90: A nucleic acid sequence of 2CFE70 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 91 : A nucleic acid sequence of 2CFE71 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

Figure 92: A nucleic acid sequence of 2CFE72 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

1 0 Figure 93 : A nucleic acid sequence of 2CFE75 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 94: A nucleic acid sequence of 2CFE76 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

15 

Figure 95; A nucleic acid sequence of 2CFE78 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 96: A nucleic acid sequence of 2CFE79 deposited with the American Type Culture 
- 20 Collection as ATCC designation on December 20, 2000. 

Figure 97: A nucleic acid sequence of 2CFE80 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 98: A nucleic acid sequence of 2CFE81 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 99: A nucleic acid sequence of 2CFE82 
Collection as ATCC designation 

30 
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Figure 100: A nucleic acid sequence of 2CFE83 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 101 : A nucleic acid sequence of 2CFE84 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 102: A nucleic acid sequence of 2CFE85 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 103: A nucleic acid sequence of 2CFE86 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000. 

Figure 104: A nucleic acid sequence of 2CFE87 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 105: A nucleic acid sequence of 2CFE88 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 106: A nucleic acid sequence of 2CFE89 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 107: A nucleic acid sequence of 2CFE90 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 108: A nucleic acid sequence of 2CFE91 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 109: A nucleic acid sequence of 2CFE92 deposited with the American Type Culture 
Collection as ATCC designation _ on December 20, 2000. 
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Figure 1 10: A nucleic acid sequence of 2CFE94 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 111: A nucleic acid sequence of 2CFE95 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 1 12: A nucleic acid sequence of 2CFE96 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 113: A nucleic acid sequence of 2CFE97 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 1 14: A nucleic acid sequence of 2CFE99 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

. Figure 115: A nucleic acid sequence of 2CFE101 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 116: A nucleic acid sequence of 2CFE102 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 117: A nucleic acid sequence of 2CFE103 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 118: A nucleic acid sequence of 2CFE104 deposited with the American Type 
Culture. Collection as ATCC designation on December 20, 2000. 

Figure 119: A nucleic acid sequence of 2CFE105 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 
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Figure 120: A nucleic acid sequence of 2CFE106 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 121: A nucleic acid sequence of 2CFE107 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 122: A nucleic acid sequence of 2CFE108 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 123: A nucleic acid sequence of 2CFE109 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 124: A nucleic acid sequence of 2CFE111 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 125: A nucleic acid sequence of 2CFE112 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 126: A nucleic acid sequence of 2CFE113 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 127: A nucleic acid sequence of 2CFE114 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 128: A nucleic acid sequence of 2CFE115 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 129: A nucleic acid sequence of 2CFE116 deposited with the American Type 
Culture Collection as ATCC designation ; on December 20, 2000. 
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Figure 130: A nucleic acid sequence of 2CFE117 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 131: Schematic structures of alkyloids which are ligands, for example, of 2CFE42. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

All scientific and technical terms used in this application have meanings commonly used in 
the art unless otherwise specified. As used in this application, the following words or 
phrases have the meanings specified. 

As used herein, a ceg nucleic acid molecule is said to be "isolated" when the nucleic acid 
molecule is substantially separated from contaminant nucleic acid molecules that encode 
polypeptides other than CEGs. Additionally, isolated nucleic acid molecule refers to any 
RNA or DNA sequence obtained from a natural source, or constructed by recombinant 
methods, or synthesized. A skilled artisan can readily employ nucleic acid isolation 
procedures to obtain an isolated nucleic acid molecule having ceg sequences. 

The term "ceg" includes all isolated forms of ceg nucleotide and CEG amino acid sequences 
disclosed herein. The ceg sequences encode gene products that have essential biological 
functions in bacterial cells, such as, for example, nucleotide biosynthesis, amino acid 
biosynthesis, DNA replication, RNA transcription, protein translation, DNA 
recombination, DNA repair, biosynthesis of cofactors (e.g., Coenzyme A), biosynthesis 
of prosthetic groups, cellular processes (e.g., chaperones, cell division, and polypeptide 
secretion), energy metabolism (e.g., pentose phosphate pathway, glycolysis, 
gluconeogenesis), fatty acid biosynthesis, cell wall biosynthesis, and/or biosynthesis of 
purines, pyrimidines, nucleosides, and nucleotides. Accordingly, the gene products of the 
ceg nucleotide sequences are required for viability of bacterial cells. The term "ceg" also 
includes variants having nucleotide sequence similarity to the disclosed ceg sequences, 
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including sequences isolated from various bacterial genera and species, allelic variants, 
mutant variants, and ceg variants that encode conservative and non-conservative amino acid 
substitutions. The present invention also provides for all ceg sequences generated by 
recombinant DNA technology, including complementary sequences, ceg sequences that 
hybridize to the sequences of the invention at high stringency hybridization conditions, 
fusion genes comprising a ceg sequence, and codon usage variants. 

The term "essential genes" refers to a nucleotide sequence that encodes a gene product 
having a function which is required for cell viability. The term "essential protein" refers 
to a polypeptide that is encoded by an essential gene and has a function that is required 
for cell viability. Accordingly, a mutation that disrupts the function of the essential gene 
or essential proteins results in a loss of viability of cells harboring the mutation. 

"Non-essential genes" or "non-essential proteins" refer to genomic information of the 
protein(s) or RNAs encoded therefrom which, when disrupted by a mutation, do not 
result in a loss of viability of cells harboring said mutation under defined laboratory 
conditions. 

As used herein, a nucleotide sequence is said to be "identical" to another reference 
sequence when both nucleotide sequences are exactly alike. 

As used herein, a nucleotide sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
sequence differences. For example, two sequences are considered to be similar to each 
other when the percentage of nucleotides that are shared between the two sequences is 
between about 70 % to 99.99% over the entire length of the two sequences. 

As used herein an amino acid sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
sequence differences. For example, two sequences are considered to be similar to each 
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other when the percentage of amino acids that are shared between the two sequences may 
be between about 30% to 100% identity over the entire length of the two sequences. 

As used herein, an "allele" or "allelic sequence" is an alternative form of the naturally- 
occurring ceg sequence. Alleles result from a mutation, that changes the nucleotide 
sequence, and generally produce altered mRNAs or polypeptides whose structure or 
function may or may not be altered. 

"Substantially purified" as used herein means a specific isolated nucleic acid or protein, 
or fragment thereof, in which substantially all contaminants (i.e. substances that differ 
from said specific molecule) have been separated from said nucleic acid or protein. 

In a host cell, an "endogenous" sequence as used herein means a nucleic acid sequence 
that is naturally-occurring and resides within the host genome. 

In a host cell, an "exogenous" sequence as used herein means an isolated nucleic acid 
sequence that is introduced into the host cell, using any one of a variety of introduction 
methods, such as transfection, electroporation, cationic lipid or salt treatment methods. 

"Knockout mutant" or "knockout mutation" as used herein refers to an in vitro engineered 
disruption of a region of endogenous chromosomal DNA (e.g., disruption of the genome), 
typically within a protein coding region. A knockout mutation can be generated by 
inserting an exogenous DNA sequence into the homologous endogenous sequence: A 
knockout mutation occurring in a protein coding region is expected to disrupt normal 
expression of the protein coding region. This usually leads to loss of the function 
provided by the protein. 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 
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A) MOLECULES OF THE INVENTION 

1.) CEG NUCLEIC ACID MOLECULES 

5 The present invention provides isolated and recombinant ceg nucleic acid molecules and 
fragments thereof, and related molecules, such as sequences complementary to ceg 
sequences or a portion thereof, and those that hybridize to the nucleic acid molecules of 
the invention. 

10 The ceg polynucleotide sequences, also referred to herein as nucleic acid molecules of the 
invention, are preferably in isolated form, including DNA, RNA, DNA/RNA hybrids, and 
related molecules, and fragments thereof. Specifically contemplated are genomic DNA, 
ribozymes, and antisense molecules, as well as nucleic acid molecules based on an 
alternative backbone or including alternative bases, whether derived from natural sources or 

1 5 synthesized. Embodiments of particular ceg polynucleotide and amino acid sequences 
include, but are not limited to, the sequences described in Tables I and II (e.g., SEQ ID 
NOS:l-113, 114-226 and SEQ ID NOS: 227-339, 340-452, respectively). The ceg 
polynucleotide and amino acid sequences were designated cfe which stands for CEG For 
Expression. 

20 

Biological samples of the 2CFE nucleic acid molecules (e.g., SEQ ID NOS: 227-331) 
were deposited on December 20, 2000 with the American Type Culture Collection 
(ATCC), 10801 University Blvd., Manassas, VA 201 10-2209. 



TABLE I 



CFE Designation 


SEQ.ro NO. 


SEQ.ro NO. 


POLARITY 




(Nucleotide) 


(Polypeptide) 




CFE 1 


1 


114 


+ 


CFE 2 


2 


115 




CFE 3 


3 


116 




CFE 4 


4 


117 " 


• + 


CFE 5 


5 


118 




CFE 6 


6 


119 


+ 



22 



WO 01/49721 



PCT/US00/35604 



TABLE I 



CFE Designation 


SEQ. ID NO. SEQ. ID NO. POLARITY 
(Nucleotide) (Polypeptide) 


CFE7 


7 


120 




CFE 8 


8 


121 


■+ 


CFE 9 


9 


122 




CFE 10 


10 


123 


+ 


CFE 11 


11 


124 • 


-f 


CFE 12 


12 


125 




CFE 13 


13 


126 




CFE 14 


14 


127 


+ 


CFE 15 


15 


128 




CFE 16 


16 


199 




CFE 17 


17 


i j\j 




CFE 19 


18 


131 


4- 


CFE 21 


19 


139 




CFE 24 


90 






CFE 25 


* 91 


1 3d 


_L 


CFE 26 


99 


1 3^ 




CFE 27 


91 


136 


_|_ 


CFE 28 


94 


137 




CFE 29 


9S 


13R 




CFE 30 




139 




CFE 31 


97 


140 


4- 


CFE 32 


98 


141 

it! 


4- 


CFE 33 


99 


149 




CFE 34 




143 


4- 


CFE 35 


J 1 


144 


4- 


CFE 36 


32 


14S 


4- 


CFE 37 


33 


146 




CFE 38 


34 


147 


+ 


CFE 39 


35 


148 




CFE 40 


36 


149 




CFE 41 


37 


150 




CFE 42 


38 


151 




CFE 43 . 


39 


152 




CFE 44 


40 


153 


+ 


CFE 45 


41 


154 




CFE 46 


42 


155 





23 



WO 01/49721 



PCT/US00/35604 



TABLE I 



CFE Designation 


SEQ. ID NO. SEQ. ID NO. POLARITY 
(Nucleotide) (Polypeptide) 


CFE 47 


43 






CFE 48 


44 


157 




CFE 49 


4S 


1 

1 JO 


J_ 
i 


CFE 50 


46 


159 


i 


CFE 51 


47 


1 uu 


j_ 
i 


CFE 52 


48 

"TO 


161 

lul 




CFE 53 


4Q 


1 oz 


i 

X 


CFE 54 


Jv 


161 

IOj 


1 

X 


CFE 55 




1 64 


I 

X 


CFE 56 


JZ 


1 6^ 
IOj 


] 

X 


CFE 57 




1 66 
100 


1 

X 


CFE 58 


<;4 


- 1A7 
10/ 


I 

X 


CFE 59 




1 Aft 
105 




CFE 60 


^6 


1 AO 

ioy 


t 

+ 


CFE 61 


J I 


1 7fi 

1 /U 




CFE 62 


<ft 

JO 


1 71 
1/1 




CFF 61 


jy 


1 77 
1 /Z 




CFF 64 


ou 


1 71 
1 / J 




OFF 6S 


01 


1 7/1 • 

1 /4 


+ 


CFF 66 


oz 


1 7< 
1 / J 


+ 


CFE 67 


A - * 

OJ 


1 7£ 
1 /O 


i 

x 


CFE 68 


0*t 


1 77 
1 / / 




CFE 69 


OJ 


1 7ft 
1 /o 




CFE 70 


66 
OO 


1 70 

1 /y 


i 

X 


CFE 71 


67 

0/ 


1 ftn 
loU 




CFE 72 


Aft 
Oo 


1 ft V 
151 




CFF 71 


60 


1 ft7 
ioz 


1 

X 


CFF 74 


70 


I O J 




CFF 7^ 


71 


184 




CFE 76 


72 


.185 


' X 


CFE 77 


73 


.186 




CFE 78 


74 


187 


.+ ■ 


CFE 79 


75 


188 




CFE 80 


76 


189 




CFE 81 


77 


190 


X 


CFE 82 


78 


191 





24 



WO 01/49721 



PCT/US00/35604 



TABLE I 



CFE Designation 


SEQ. ID NO. SEQ. ID NO. POLARITY 
(Nucleotide) (Polypeptide) 


CFE83 


79 


192 




CFE 84 


80 


193 




CFE 85 


81 


194 




CFE 86 


82 


195 




CFE 87 


83 


196 




CFE 88 


84 


197 




CFE 89 


85 


198 




CFE 90 


86 


199 


+ 


CFE 91 


87 


200 




CFE 92 


88 


201 




CFE 93 


89 


202 


+ 


CFE 94 


90 


203 


+ 


CFE 95 


91 


204 




CFE 96 


92 


205 


+ 


CFE 97 


93 


206 




CFE 98 


94 


207 




CFE 99 


95 


208 


+ 


CFE 100. 


96 


209 




CFE 101 


97 


210 




CFE 102 


98 


211 




CFE 103 


99 


. 212 




CFE 104 


100 


213 


+ 


CFE 105 


101 


214 




CFE 106 


102 


215 




CFE 107 


103 


216 




CFE 108 


104 


217 


+ 


CFE 109 


105 


218 




CFE110 


106 


219 




CFE 111 


. 107 


220 




CFE 112 


108 


221 




CFE 113 


109 


222 




CFE 114 


110 . 


223 




CFE 115 


Ill 


224 




CFE 116 


112 


225 




CFE 117 


113 


226 





25 



WO 01/49721 



PCT/US00/35604 



TABLE H 







OXL\£. JLU INU. 


rIGURE 




l UvlvU UUC 1 


\jl uiypepiiuej 




2CFE 1 






zo 


2CFE2 






z/ 


2CFE 3 






zo 


2CFE4 






90 


2CFE 5 








2CFE 6 






31 








iz 


2CFE 8 






JO 


2CFE 9 






1A 


icw in 






35 


9PFF 1 1 






36 


9PFF 19 
zv^rnr iz 






37 


9PFF 1 1 






38 


9r*FP 1A 






39 


9PFF 1 <\ 






40 


9PFF 1£ 






A 1 

41 


9PFF 17 






42 


9PFF 10 






43 


9PFF 9 1 
Zv-rrn zi 






44 


9PFF 9/1 
Zk-Jrn z*f 






45 


ZK^rtj ZD 






46 


ZV^rJD ZD 






47 


9PFF 97 
zv^rii z / 






A O 

48 


9PFF 98 
zv_,r.c Zo 






/in 

49 


9PFF 90 
zv^rxZf zy 






f A 

jO 


9PFF 






51 


9PFF ^ 1 








9PFF ^9 








2CFE 33 








2CFE 34 






55 


2CFE 35 






56 


2CFE36 






57 


2CFE37 






58 


2CFE38 






59 


2CFE39 






60 



26 



WO 01/49721 



PCT/US00/35604 



CFF Designation 


SEO ID NO 


SEO ID NO 


FICIFRF 




(Nucleotide) 


(Polypeptide) 




2CFE 40 






61 


2CFE 41 






62 


2CFE 42 






* 63 


2CFE 43 






64 


2CFE 44 






65 


2CFE 45 






66 


2CFE 46 






67 


2CFE 47 






68 


2CFE 48 






69 


2CFE 49 






70 


2CFE 50 






71 


2CFE51 






72 


2CFE 52 






73 


2CFF 53 






74 


2CFE 54 






75 


7CFF 55 






76 


2CFF 56 






77 


9rFF 57 






78 








79 


9PFF SO 






80 








81 


OPEC <1 






82 


9PFF 6? 






83 


9PFF 64 






84 


2CFF 65 






85 


2CFF 66 






86 


2CFE 67 






87 


2CFE 68 






88 


9CFF 69 


■■- ■- ■-■ - - 




89 


9PFF 70 






90 


OPFT? 71 






91 


9PT7T? 77 






92 


2CFE 75 






93 


2CFE 76 . 






94 


2CFE 78 






95 


2CFE79 






96 


2CFE 80 






97 



27 



WO 01/49721 



PCT/US00/35604 



v^r Hi IScSlgUalluIl 












li UijUcu liuct 




OCW 81 






yo 


nCW 89 
Zv>riz» oz 






oo 

yy 


zbrC/ oj 






i oo 
1UU 


9PFF 84 






ioi 


9PFF 8S 

ZV^F.Cr 






1 09 
1UZ 








1 uJ 


2CFF 87 








2PFF 88 






ins 

1UJ 


9PFF 80 






10£ 


9PFF on 








9PFF 01 






108 
Iuo 


z^rii yz 






1 00 
Ivy 


9PFF Qd 






1 1 0 
1 IU 


9PFF OS 






1 1 1 
ill 


9PFF 06 






1 19 
1 1Z 


9r*T7C 07 

ZL^rJc y / 






1 1 7 
1 13 


9r*T7P oo 

z^rJb yy 






11/1 
1 14 


zCrb 1U1 






l Id 


zt^rJb lUz 






1 lo 


zCrii 1UJ 






117 
11/ 


7/~ , t?n i A/i ' 
zLrb 1U4 






110 
I 1 0 


9r*FR 1 
z^rn luj 






1 1 O 

i iy 


9PFF 10£ 






190 
XZU 


or*w 1 r»7 
zv^rii i\j / 






191 
IZl 


9 PITT? 10B 






199 
1ZZ 


9 OPT? i no 
Zv^rii iuy 






191 
IZj 


1 9f T E?l? ill 
zJLrli 111 






19A 
1Z4 


9/" v ET? 117 

ZLrii 1 Iz 






19^ 
IZj 


9PFF 1 1 ^ 






126 


2CFE114 






127 


2CFEU5 






128 


2CFE116 






129 


2CFE117 ' 






130 



28 



WO 01/49721 



PCT/US00/35604 



a) Variant ceg Nucleotide Sequences 

The present invention also provides nucleic acid molecules having a nucleotide sequence 
substantially identical or similar to the ceg sequences (SEQ ID NOS: 1-113, 227-331) 
disclosed herein. 

The present invention provides nucleotide sequences which are similar to SEQ ID 
NOS:l-113 and/or SEQ ID NOS:227-331. The present invention provides nucleotide 
sequences which vary from SEQ ID NOS:l-113 or 227-331 by a range of about 1% to 
about 70%. 

The present invention encompasses variations in polynucleotide sequences resulting from 
mutations and/or from transfer of genetic material from one cell to another (e.g., 
horizontal gene transfer or horizontal gene exchange). 

The present invention also provides for variants of the polynucleotide ceg sequences 
disclosed herein, including variants isolated from naturally-occurring sources, those 
generated by recombinant DNA technology or other in vitro synthesis methodologies 
(e.g., PCR). The variant polynucleotide sequences of the invention encode polypeptides 
that exhibit the biological activity of naturally-occurring CEG polypeptides, such as 
activity required for bacterial cell viability. 

In general, for example, a variant of ceg polynucleotide sequences may encode a 
polypeptide that differs by one or more amino acid substitutions. The variant may have 
conservative changes, wherein a substituted amino acid has similar structural or chemical 
properties, eg, replacement of leucine with isoleucine. 

A polynucleotide sequence can encode conservative amino acid substitutions without 
altering either the conformation or the 'function of the polypeptide. Such changes include 
substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these 
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hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice 
versa. Other substitutions can also be considered conservative, depending on the 
environment of the particular amino acid and its role in the three-dimensional structure of 
the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, 
as can alanine (A) and valine (V). Methionine (M), which is relatively hydrophobic, can 
frequently be interchanged with leucine and isoleucine, and sometimes with valine. 
Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these 
two amino acid residues are not significant. Still other changes can be considered 
"conservative" in particular environments. 

A variant may also have nonconservative changes, eg, replacement of a glycine with a 
tryptophan. Other variations may also include amino acid deletions or insertions, or both. 
Guidance in determining which and how many amino acid residues may be substituted, 
inserted or deleted without abolishing biological or immunological activity may be found 
using computer programs well known in the art, for example, DNASTAR software. 

Another type of ceg sequence variant includes naturally-occurring allelic variants of ceg 
which share significant similarity (e.g., between about 30- 99%) to the disclosed CEG 
polypeptide sequence. Allelic variants of the ceg sequences can encode conservative or 
non-conservative amino acid substitutions of the CEG polypeptide sequence herein 
described. 

An example of allelic variants of ceg are mutant alleles of ceg polynucleotide sequences that 
encode a polypeptide having one or more changes in the polypeptide sequence, such as 
amino acid substitutions, deletions, insertions, frame shifts, or truncations. The mutant 
alleles of ceg may or may not encode a CEG polypeptide having the same biological 
functions as wild-type CEG proteins. 
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Variations in the bacterial genomic sequences can also arise from transfer of genetic 
material to another bacterial cell. The transfer of gene sequences can occur intraspecies 
or interspecies. Gene transfer can occur between bacterial cells which are members of 
the same or different populations. A population includes, but is not limited to, a serotype 
isolate, a clinical isolate, a naturally-occurring isolate, a strain, and a species. The 
transfer of genetic material can occur between cells within a population; for example 
transfer between serotype A to serotype A, or between S. pneumoniae and S. 
pneumoniae. The transfer of genetic material can occur between cells of different 
populations; for example, between serotype A to serotype B or S. pneumoniae and S. 
mutans. 

Gene transfer can give rise to mutant or polymorphic variant genes sequences. In rare 
cases, gene transfer introduces new gene sequences that confer a new phenotype, such as 
antibiotic resistance. The transfer of genetic material includes transfer of large regions of 
genomic sequences which include partial gene sequences, whole single gene sequences, 
or multiple gene sequences. This mode of transfer can give rise to replacement of native 
whole gene sequences or introduction of new sequences in the recipient cell. This mode 
of transfer gives rise to mosaic gene sequences in the recipient cell. 

The variation of genomic sequences resulting from gene transfer can be examined using 
molecular techniques, including: multilocus enzyme electrophoresis (Selander. R. K., et 
al., 1986 Appl Environ. Microbiol 51:837-884); and restriction endonuclease cleavage 
electrophoretic profiling (Coffey, T. J., et al., 1991 Mol Microbio. 5:2255-2260); pulse- 
field gel electrophoresis fingerprinting (Bygraves, J. A. and Maiden, M. C. J. 1992 J. 
Gen. Microbiol 138:523-531); and ribotyping (Stull, T. L., et al., 1988 1 Infect Dis. 
157:280-286). The degree of variation can vary greatly, and ranges from little or no 
variation as exemplified by gene sequences of E. coli (Caugant, d. A., et al., 1981 
Genetics 98:467-490; Whittam, T. S., et al., 1983 Mol Biol Evol 1:67-83; Souza, V., et 
al., 1992 Proc. Natl Acad ScL USA 89:8389-8393) and Salmonella (Selander, R. K., et 
al., 1990 Infect Immun 58:2262-2275; Selander, R.K. and Smith, N. H. 1990 Rev. Med. 
Microbiol 1:219-228; Smith, J. M., et al., 1993 Proc. Natl Acad Set USA 90:4384- 
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4388), to extensive gene transfer in Neisseria gonorrhoeae (Smith, J. M, et al, 1993 
Proc. Natl Acad Set USA 90:4384-4388). 

Gene transfer can be examined between various isolates of a particular microbial species 
which are antibiotic-sensitive or antibiotic-resistent (Coffey, T. J., et al., 1991 Molec. 
Microbiol 5:2255-2260). Molecular biology techniques can be utilized to study the 
degree of transfer between populations, such as, for example, the degree of gene transfer 
between serotypes, isolates, strains/or species . The degree of transfer can be examined 
by comparing, for example, the penicillin binding proteins and numerous different loci 
which encode metabolic enzymes or capsular biosynthesis enzymes. 

For example, intra-species, inter-serotype, gene transfer is possible (Coffey, T. J., et al., 
1991 supra). Additionally, intraspecies gene transfer in S. pneumoniae (Coffey, T. J., et 
al., 1998 MoL Microbiol 27:73-83), Vibrio cholerae (Bik, E. M., et al., 1995 EMBO J. 
14:209-216), and Haemophilus influenzae (Kroll, J. S. and Moxon, E. R. 1990 1 
Bacteriol 172: 1374-1379) are possible. 

Interspecies gene transfer is also possible (Dowson, C. G, et al., 1 989 Proc. Natl Acad 
Set USA 86:8842-8846; Laibl, G., et al, 1991 MoL Microbiol 5:1993-2002; Bourgoin, 
F, et al., 1999 Gene 233:151-161). 

Variant gene sequences arising from gene transfer can be continually generated in 
transformable bacteria (e.g., transformation competent), such as S. pneumoniae. For 
example, the worldwide spread of varying degrees of antibiotic resistance has. been 
documented and reviewed (Dowson, C. G., et al, 1994 Trends Microbiol 2:361-366; 
Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M and Hakenbeck, R. 1994 pp. 517- 
534; and reviewed in Maiden, M. C. J. 1998 Clinic. Infect. Dis. 27 (Supplement 1) S12- 
S20). For example, variant gene sequence arising from gene transfer can be tracked 
using a marker gene such as the gene which encodes the penicillin binding protein 
(Barcus, V. A, et al, 1995 FEMS Microbiol Lett 126:299-303). At the nucleotide level, 
gene sequences encoding the penicillin binding proteins in susceptible and resistant 
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strains differ by about 14% to 23% (Hakenbeck, R. 1995 Biochem. Pharmacol 50:1121- 
1 127; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 
517-534; Spratt, B. G., et al., 1991 Neisseria meningitidis and Streptococcus pneumoniae 
eds. Camisi, J., et al., pp. 73-83; Coffey, T. J., et al., 1995 Micro, Drug Resist. 1 :29-34). 

The ceg nucleotide sequences can be isolated from various species of Streptococcus 
including Streptococcus pneumoniae. Additionally, the ceg sequences can be isolated from 
other Steptococcal species, including S. mutans, & pyogenes, and S. thermophila, The ceg 
polynucleotide sequences can also be isolated from strains of other bacterial genera 
including, but not limited to, Streptococcus, Escherichia, Bacillus t Pseudomonas, 
Yersinia, Salmonella, and Haemophilus. 

The present invention additionally provides isolated codon-usage variants that differ from 
the disclosed ceg nucleotide sequences, yet do not alter the predicted CEG polypeptide 
sequence or function. The codon-usage variants may be generated by recombinant DNA 
technology. Codons may be selected to optimize the level of production of the ceg 
transcript or CEG polypeptide in a particular prokaryotic or eukaryotic expression host, 
in accordance with the frequency of codon utilized by the host cell. Alternative reasons 
for altering the nucleotide sequence encoding a CEG polypeptide include the production 
of RNA transcripts having more desirable properties, such as an extended half-life or 
increased stability. A multitude of variant ceg nucleotide sequences that encode the 
respective CEG polypeptide may be isolated, as a result of the degeneracy of the genetic 
code. Accordingly, the present invention contemplates selecting every possible triplet 
codon to generate every possible combination of nucleotide sequences that encode the 
disclosed CEG polypeptides. This particular embodiment provides isolated nucleotide 
sequences that vary from the sequences as described in SEQ ID NOs.: 1-113 or 227-331, 
such that each variant nucleotide sequence encodes a polypeptide having sequence 
identity with the amino acid sequences, as described in SEQ ID NOs.: 114-226 or 332- 
436, respectively. 



33 



WO 01/49721 



PCT/US00/35604 



b) Complementary Sequences 

The present invention includes polynucleotide sequences that are complementary to the 
sequences disclosed herein. The term "complementary" as used herein refers to the 
capacity of purine and/or pyrimidine nucleotides to associate through hydrogen bonding 
to form double stranded nucleic acid molecules. The following base pairs are related by 
complementarity: guanine and cytosine; adenine and thymine; and adenine and uracil. 
Complementary applies to all base pairs comprising at least two single-stranded nucleic 
acid molecules. 

c) Sequences Capable of Hybridizing 

Another embodiment provides nucleic acid molecules that will hybridize to ceg 
sequences under hybridization conditions. It is readily apparent to one skilled in the art 
that the stringency of the hybridization condition selected will depend upon the 
characteristics of the nucleic acid molecule to be hybridized, such as, the length, the 
degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T 
content, and the objective of the hybridization experiment. 

The hybridization procedure may by performed in low stringency hybridization 
conditions. Low stringency hybridization conditions will permit hybridization between 
two nucleic acid molecules that differ from exact complementarity by about 25% to 70%. 
Hybridization under standard high stringency conditions will occur between two 
complementary nucleic acid molecules (e.g., 100% exact complementarity) or two 
complementary nucleic acid molecules that differ from exact complementarity by about 
1% to about 70%. 

The high stringency hybridization conditions that disfavor non-homologous base pairing 
are well known in the art. Typically, high stringency hybridization conditions, includes 
but is not limited to, hybridizing at 50 °C to 65 °C in 5X SSPE, and washing at 50 °C to 
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65 °C in G,5X SSPE. Typically, low stringency conditions, includes but is not limited to, 
hybridizing at 35 °C to 37 °C in 5X SSPE and 40% to 45% formamide and washing at 42 
°C in 1-2X SSPE. The conditions and formulas for high stringency hybridization 
methods are well known in the art and can be readily obtained in Molecular Cloning; A 
Laboratory Manual (2 nd edition, Sambrook, Fritch, and Maniatis 1989, Cold Spring 
Harbor Press) or in Short Protocols in Molecular Biology (Ausubel, F. M., et al., 1989, 
John Wiley & Sons).- 

d) Fragments of ceg Sequences 

The invention further provides nucleic acid molecules having fragments of the ceg 
sequences, such as a portion of the ceg sequence (e.g., SEQ ID NOS:l-113, 227-331) 
disclosed herein. The size of the fragment will be determined by its intended use. For 
example, the length of the fragment to be used as a nucleic acid probe or PCR primer is 
chosen to obtain a relatively small number of false positives during probing or priming. 
Alternatively, a fragment of the ceg sequence may be used to construct a recombinant fusion 
gene having a ceg sequence fused to a non-ceg sequence. 

The nucleic acid molecules, fragments thereof, and probes and primers of the present 
invention are useful for a variety of molecular biology techniques including, for example, 
hybridization screens of libraries, or detection and quantification of mRNA transcripts as 
a means for analysis of gene transcription and/or expression. Preferably, the probes and 
primers are DNA. A probe or primer length of at least 15 base pairs is suggested by 
theoretical and practical considerations (Wallace, B. and Miyada, G. 1987 
"Oligonucleotide Probes for the Screening of Recombinant DNA Libraries" in: Methods 
in Enzymology, 152:432-442, Academic Press). Other lengths of fragments, probes, or 
primers are possible and routine to determine. 

The probes and primers of this invention can be prepared by methods well known to 
those skilled in the art (Sambrook, et'al. supra). In a preferred embodiment the probes 
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and primers are synthesized by chemical synthesis methods (ed: Gait, M. J. 1984 
Oligonucleotide Synthesis, IRL Press, Oxford, England). 

One embodiment of the present invention provides nucleic acid primers that are 
complementary to ceg sequences, which allow the specific amplification of nucleic acid 
molecules of the invention or of any specific parts thereof. Another embodiment 
provides nucleic acid probes that are complementary for selectively or specifically 
hybridizing to the ceg sequences or to any part thereof. 

e) Derivative Nucleic Acid Molecules 

The nucleic acid molecules of the invention include peptide nucleic acids (PNAs), or 
derivative molecules such as phosphorothioate, phosphotriester, phosphoramidate, and 
methylphosphonate, that specifically bind to single-stranded DNA or RNA in a base pair- 
dependent manner (Zamecnik, P. C. 3 et al., 1978 Proc. Natl. Acad Set 75:280284; 
Goodchild, P. C, et al., 1986 Proc. Natl Acad Set 83:4143-4146). 

PNA molecules comprise a nucleic acid oligomer to which an amino acid residue, such as 
lysine, and an amino group have been added. These small molecules, also designated 
anti-gene agents, stop transcript elongation by binding to their complementary (template) 
strand of nucleic acid (Nielsen, P. E., et al., 1993 Anticancer Drug Des 8:53-63). For 
example, reviews of methods for synthesis of DNA, RNA, and their analogues can be 
found in: Oligonucleotides and Analogues, eds. F. Eckstein, 1991, IRL Press, New York; 
Oligonucleotide Synthesis, ed. M. J. Gait, 1984, IRL Press, Oxford, England. 
Additionally, methods for antisense RNA technology are described in U. S. patents 
5,194,428 and 5,1 10,802. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg polynucleotide sequences, see for example 
Innovative and Perspectives in Solid Phase Synthesis (1992) Egholm, et al. pp 325-328 or 
U.S. Patent No. 5,539,082. 
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f) RNA Molecules 

The present invention provides RNA molecules that encode the predicted ceg gene 
products. In particular, the RNA molecules of the invention may be isolated full-length 
or partial mRNA molecules or RNA oligomers that encode CEG gene products. The 
RNA molecules of the invention include the nucleotide sequences encoding all or 
portions of CEGs. 

The RNA molecules of the invention also include antisense RNA molecules, peptide 
nucleic acids (PNAs), or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind to the sense strand of DNA or RNA in a base pair- 
dependent manner. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg sequences. 

g) Labeled Nucleic Acid Molecules 

The nucleic acid molecules having ceg sequences can be labeled with a detectable 
marker. Examples of a detectable marker include, but are not limited to, a radioisotope, a 
' fluorescent compound, a bioluminescent compound, a chemiluminescent compound, a 
metal chelator or an enzyme. Technologies for generating labeled DNA and RNA probes 
are well known in the art (See e.g. Sambrook et al., supra). 

2.) RECOMBINANT NUCLEIC ACID MOLECULES 

Also provided are recombinant nucleic acid molecules, such as recombinant DNA molecules 
(rDNAs) that comprise ceg sequences or Segments thereof. As used herein, a recombinant 
DNA molecule is a DNA molecule that has been subjected to molecular manipulation in vitro. 
Methods for generating rDNA molecules are well known in the art, for example, see Sambrook 
et al., Molecular Cloning (1 989), supra. 
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a) Vectors 

The nucleic acid molecules of the invention may be recombinant molecules each 
comprising the sequence, or portions thereof, of a ceg sequence linked to a non-ceg 
5 sequence. For example, the ceg sequence may be fused operatively to a vector to 
generate a recombinant molecule. The term vector includes, but is not limited to, 
plasmids, cosmids, and phagemids. A preferred vector includes an autonomously 
replicating vector comprising a replicon that directs the replication of the rDNA within the 
appropriate host cell. The preferred vectors can also include an expression control 

10 element, such as a promoter sequence, which enables transcription of the inserted ceg 
sequences and can be used for regulating the expression (e.g., transcription and/or 
translation), of an operably linked ceg sequence in an appropriate host cell such as 
Escherichia coll Expression control elements are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, enhancers, 

15 transcription terminators, and other transcriptional regulatory elements. Other expression 
control elements that are involved in translation are known in the art, and include the Shine- 
Dalgamo sequence, and initiation and termination codons. The preferred vector also 
includes at least one selectable marker gene that encodes a gene product that confers drug 
resistance such as resistance to ampicillin or tetracyline. The vector also comprises 

20 multiple endonuclease restriction sites that enable convenient insertion of .exogenous 
DNA sequences. 

The preferred vectors for generating ceg transcripts and/or the encoded CEG polypeptides 
are expression vectors which are compatible with prokaryotic host cells. Prokaryotic cell 
25 expression vectors are well known in the art and are available from several commercial 
sources. For example, a pET vectors (e.g., pET-21, Novagen Corp.) may be used to 
express CEG polypeptides in bacterial host cells. 
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b) Recombinant Vectors for Integration 

The present invention provides recombinant vectors that may be used to integrate 
exogenously provided sequences into the genome of a host cell. The recombinant 
integration vectors of the present invention include a gene that encodes a selectable 
marker and ceg sequences, or fragments thereof. The integration vectors are used to 
integrate the ceg sequence into a target gene sequence that resides within the bacterial 
host genome (e.g., endogenous sequence), thereby disrupting the function of the target 
gene sequence within the bacterial cells. These integration vectors may be used in a gene 
disruption assay to screen candidate ceg nucleotide sequences, in order to identify the 
candidate sequences that encode a gene product that is required for bacterial cell viability. 

Accordingly, these recombinant integration vectors include candidate ceg sequences that 
will be screened to determine if the candidate ceg sequences encode a gene product that 
is required for cell viability. The candidate ceg sequence that is included as part of the 
recombinant integration vector is the "exogenous" ceg sequence that is employed as the 
"disrupting" sequence in a gene disruption assay. The ceg sequence that resides within 
the host genome is the "endogenous" or "target" ceg sequence. 

The integration event rarely occurs, for example, by non-homologous recombination in 
which a recombinant vector, that includes the exogenous ceg sequence, inserts the 
exogenous ceg sequence into a random location within the host genome. In a more 
preferred embodiment, the integration event inserts the exogenous ceg sequence into a 
specific target site within the host genome. The targeted integration event can involve 
homologous recombination in which the integration vector, that includes the exogenous 
ceg sequence, inserts the exogenous ceg sequence into its homologous target ceg 
sequence that resides within the host's genome (e.g., the endogenous ceg sequence) 
(Figure 1). Further, the exogenous ceg sequence can be used as a disrupting sequence 
whereby the homologous recombination event integrates the exogenous ceg sequence 
into the endogenous target ceg sequence resulting in disruption of the function of the 
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endogenous ceg sequence. For example, disrupting the function of the endogenous ceg 
sequence may result in the loss of bacterial cell viability. 

An example of a recombinant vector that can be used as an integration vector in S. 
5 pneumoniae is the pEVP-3 vector (Jean-Pierre Claverys, et al. 1995 Gene 164: 123-128). 
The pEVP-3 vector integrates an exogenous sequence by homologous recombination 
involving a Campbell-type event (S. Adhya and A. Campbell 1970 J. Mol Biol. 50:481- 
490). The pEVP-3 vector includes a replicon that functions only in gram-negative 
bacteria, such as E. coll Therefore, the pEVP-3 vector cannot replicate in S. 
10 . pneumoniae. This vector also contains multiple cloning sites, and confers resistance to 
chloramphenicol in both a gram-negative and gram-positive bacteria, such as S. 
pneumoniae. 

c) Fusion Gene Sequences 

15' 

A fusion ceg gene is another example of a recombinant molecule of the invention. A fusion 
gene includes a ceg sequence operatively fused (e;g., linked) to a non-ceg sequence such as, 
for example, a tag sequence to facilitate isolation and/or purification of the expressed 
CEG gene product (Kroll, D.J., et al., 1993 DNA Cell Biol 12:441-53). 

20 

Alternatively, a recombinant fusion molecule has a ceg sequence of the invention fused to 
a ceg sequence isolated from a different microbial source. For example, the disclosed ceg 
sequences isolated from S. pneumoniae can be fused to a ceg sequence isolated from a 
different bacterial species. 

25 

3.) CEG PROTEINS AND POLYPEPTIDE MOLECULES 

The invention additionally provides CEG proteins and peptide fragments thereof that are 
isolated or substantially purified. Embodiments of particular CEG amino acid sequences 
30 are disclosed in Tables I and H (SEQ ID NOS:114-226 and SEQ ID NOS:332-436, 
respectively). 
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The present invention also includes polypeptides having sequence variations from the 
predicted CEG polypeptide sequences disclosed herein, including mutant variants, 
conservative substitution variants, and similar CEG polypeptides from other prokaryotic 
organisms. For convenience, such proteins are referred to herein as "CEG proteins", 
"CEG polypeptides", or "proteins of the invention". 

As used herein, CEG protein refers to a polypeptide having amino acid sequence identity or 
similarity to any one of the predicted amino acid sequences, as provided in SEQ ID NO.: 
1 14-226 or 332-436. The variant CEG polypeptides can be allelic forms of CEG, such as 
mutant forms of CEG polypeptides. The present invention also provides conservative 
substitution-mutants of the CEG proteins that maintain functional activity of wild-type CEG 
(e.g., the CEG polypeptide is required for bacterial cell viability). 

The CEG protein may be isolated from any source whether natural, synthetic, semi- 
synthetic, or recombinant. As used herein, "natural" refers to a polypeptide which is 
found in nature. Accordingly, the CEG proteins may be isolated from a prokaryotic 
organism, such as a bacterial strain including, but not limited to, Streptococcus, 
Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Streptomyces. The CEG 
proteins of the invention, and fragments thereof, can also be generated by recombinant 
methods or chemical synthesis methods. 

The CEG polypeptides of the invention are essential for the viability of a bacterial cell. 
Further, the CEG polypeptides can exhibit at least any one of the following functions: a 
pantothenate kinase, a Holliday Junction branch migration protein, a single stranded 
DNA binding protein, a phosphoglucosamine mutase, an acetyltransferase, an 
uridylyltrarisferase, a malonyl CoenzymeA:ACP transcylase, a 3-oxoacyl-ACP synthase 
II, a 3-oxoacyl-ACP reductase, a phosphomethylpyrimidine (HMP-P) kinase, a GTP 
binding protein, a ATP binding protein, or a 4-aminoimidazole carboxylase. Putative 
functions can include, but are not limited to, sugar transferase, techoic acid biosynthesis, 
ribosome recycling factor, response regulator, nicotinate phosphoribosyltransferase, 
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nitropropane dioxygenase, (3R)-hydroxymyristol acyl carrier protein dehydrase, sugar 
dehydrogenase, murein biosynthesis, cobalimin biosynthesis, ABC transporter, tRNA 
modification enzyme, arylsulfatase, 16S processing enzyme, tRNA methyl transferase, 
elongation factor P, signal recognition particle, protein export, undecaprenol kinase, SRP 
docking domain, diacyl glycerol kinase, dihydopicilinate reductase, HU-DNA binding 
protein, thiamine biosynthase, GreA transcription elongation factor, dlDP-L-rhamnose 
synthase, ATP-binding motif, ribose-5-p-3-epimerase-like activity, GTP 
pyrophosphokinase, acetyl-CoA carboxylase, O-sialoglycoprotein endopeptidase, 
glucosamine-fhictose-6-phosphase aminotransferase, Strpn adhesion-associated ABC- 
permease, GTP pyrophosphokinase RelA, IMP dehydrogenase, DNA gyrase subunit B, 
acetyl-CoA carboxylase subunit AccD, phosphoglycerol kinase, acetyl-CoA carboxylase 
carbonyl transferase,, phosphopanthetheine adenylyltransferase, oligopeptide transport 
permease subunit, translocation protein, perM permease, DNA pol HI gamma and tau 
subunits, DNA pol HI delta subunit, signal peptidase I, acetyl-coA carboxylase biotin 
carboxyl carrier protein, protein chain release factor- 1, replicative DNA helicase, 
topoisomerase, pentapeptide-transferase, elongation factor G, spore coat polysaccharide 
biosynthesis protein C, protein release factor B, DNA polymerase III alpha subunit, 
phosphoprotein phosphatase, chaparonin, UDP-N-acetylmuramoylalanyl-D-glutamate-2, 
6-diaminopimelate ligase, techuronic acid biosynthesis, UDP-glucose lipid carrier 
transferase, transcription termination factor, chromosome segregation factor, amino acid 
biosynthesis, HMG-CoA reductase, hypoxanthine-guanine phosphoribosyltransferase. 

a) MODULATORS OF CEG POLYPEPTIDES 

The invention provides compounds that modulate (e.g., activate or inhibit) the function of 
a CEG polypeptide. Such compounds can provide lead-compounds for developing drugs 
for diagnosing and/or treating conditions associated with bacterial infections. The 
modulator is a compound that may alter the function of the CEG polypeptide, such as 
activating or inhibiting the function of a CEG polypeptide. For example, the compound 
can act as agonist, antagonist, partial agonist, partial antagonist, cytotoxic agents, 
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inhibitors of cell proliferation, and cell proliferation-promoting agents. The activity of 
the compound may be known, unknown or partially known. 

Suitable ligands include, but are not limited to, diazalactones, Af-protected amino acid, 
azabicyclodiene, and alkaloids. 

* An example of a diazalactone is: 




An example of a ^-protected amino acid is: 




An example of an azabicyclodiene is: 
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5 B) METHODS FOR MAKING THE CEG PROTEINS AND POLYPEPTIDES 

Recombinant methods are preferred if a high yield is desired. Recombinant , methods 
involve expressing the cloned gene in a suitable host cell. For example, a host cell is 
introduced with an expression vector having the CEG sequence, then the host cell is 
10 cultured under conditions that permit in vivo production of the CEG protein. The 
recombinant vector can integrate the CEG sequence into the host genome. Alternatively, 
the CEG sequence can be maintained extra-chromosomally, as part of ah autonomously 
replicating vector. 

15 1. HOST-VECTOR SYSTEMS 

The invention further provides a host-vector system comprising the vector, plasmid, 
phagemid, or cosmid comprising a ceg nucleotide sequence, or a fragment thereof, 
introduced into a suitable host cell. The host-vector system can be used to produce the 

44 



WO 01/49721 



PCTYUSOO/35604 



CEG polypeptides encoded by the ceg nucleotide sequences. The host cell can be 
prokaryotic or eukaryotic. Examples of suitable prokaryotic host cells include bacteria 
strains from genera such as Escherichia, Bacillus, Pseudomonas, Streptococcus, and 
Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, 
5 or an animal cell, such as a mammalian cell. A preferred embodiment provides a host- 
vector system comprising the pET21 vector having a ceg sequence introduced into an E. 
coli XDE3 lysogen which is useful, for example for the production of the CEG protein, 
herein designated CFE polypeptides and CFE proteins. 

1 0 Introduction of the rDNA molecules of the present invention into an appropriate cell host is 
accomplished by well known methods that typically depend on the type of vector used and 
host system employed. For example, transformation of prokaryotic host cells by 
electroporation and salt treatment methods are typically employed, see for example, Cohen 
etal., 1972 Proc Acad Sci USA 69:2110; Maniatis,T., etal, 1989 Molecular Cloning, A 

15 Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
Transformation of vertebrate cells with vectors containing rDNAs, electroporatibn, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et ah, 
1973 Virol 52:456; Wigler etal., 1979 Proc Natl Acad Sci USA 76:1373-76. 

20 Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 
invention, can be identified by well known techniques. For example, cells resulting from 
the introduction of a rDNA of the present invention can be selected and cloned to produce 
single colonies. Cells from those colonies can be harvested, lysed and their DNA content 
examined for the presence of the rDNA using a method such as that described by Southern, 

25 JMol Biol (1975) 98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced 
from the cell assayed via a biochemical assay or immunological method. 

Procaryotes are generally used as host cells for cloning and producing the products of 
exogenous DNA sequences. For example, the Escherichia coli K12 BL21 (ADE3) 
30 (Novagen) is particularly useful for expression of foreign proteins. Other strains of E. 
coli, and bacilli such as Bacillus subtilis, Enterobacteriaceae such as Salmonella 
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typhimurium or Serratia marcescans, various Pseudomonas, Streptococcus, and 
Streptomyces species may also be employed as host cells in cloning and expressing the 
recombinant proteins of this invention. 

In general terms, the production of recombinant CEG proteins may involve using a 
host/vector system, or other methods may be used. The host/vector system may employ the 
following steps. 

A nucleic acid molecule is obtained that encodes a CEG protein or a fragment thereof, such 
as any one of the polynucleotides disclosed in SEQ ID NOs.: 1 - 1 1 3 or 227-33 1 . The CEG- 
encoding nucleic acid molecule is preferably inserted into an expression vector in operable 
linkage with suitable expression control sequences, to generate an expression vector 
including the CEG-encoding sequence. The expression vector is introduced into a suitable 
host, by standard transformation methods, and the resulting transformed host is cultured 
under conditions that allow the production of the CEG protein. For example, if expression 
of the CEG gene is under the control of an inducible promoter, then suitable growth 
conditions would include the appropriate inducer. The CEG protein (e.g., designated a 
CFE polypeptide or protein), so produced, is isolated from the growth medium or directly 
from the cells; recovery and purification of the protein may not be necessary in some 
instances where some impurities may be tolerated. A skilled artisan can readily adapt an 
appropriate host/expression system known in the art for use with CEG-encoding sequences 
to produce a CEG protein (Cohen, et al. , supra; Maniatis et al., supra). 

Host cells harboring the nucleic acids disclosed herein are also provided by the present 
invention. A preferred host is E. coli strain BL21(AJDE3) transfected or transformed with 
a vector comprising a nucleic acid of the present invention. The invention also provides a 
host cell capable of expressing the ceg sequences described herein. The preferred host 
cell is any strain of E. coli that can accommodate high level expression of an exogenously 
introduced gene. 
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The proteins of the present invention can also be made by chemical synthesis. The 
principles of solid phase chemical synthesis of polypeptides are well known in the art and 
may be found in general texts relating to this area (Dugas, H. and Penney, C. 1981 
Bioorganic Chemistry, pp 54-92, Springer- Verlag, New York). CEG polypeptides may 
be synthesized by solid-phase methodology utilizing an Applied Biosystems 430A 
peptide synthesizer (Applied Biosystems, Foster City, Calif.) and synthesis cycles 
supplied by Applied Biosystems. Protected amino acids, such as t-butoxycarbonyl- 
protected amino acids, and other reagents are commercially available from many 
chemical supply houses. 

The polypeptides of the invention exhibit properties of a CEG protein, such as, for 
example, the ability to elicit the generation of antibodies that specifically bind an epitope 
associated with CEG polypeptides. Accordingly, the CEG polypeptide, or any 
oligopeptide thereof, is capable of inducing a specific immune response in appropriate 
animals or cells and binding with specific antibodies. 

Q ANTIBODIES THAT RECOGNIZE AND BIND THE PROTEINS AND 
POLYPEPTIDES OF THE INVENTION 

The invention further provides antibodies (e.g., polyclonal, monoclonal, chimeric, 
humanized, and human antibodies) that bind a CEG polypeptide. The most preferred 
antibodies will selectively bind a CEG polypeptide and will not bind (or will bind weakly) a 
non-CEG polypeptide. Antibodies that are particularly contemplated include monoclonal 
and polyclonal antibodies, as well as fragments thereof (e.g., recombinant proteins) which 
include the antigen binding domain and/or one or more complement detennining regions of 
these antibodies. These antibodies can be from any source, for example, rabbit, sheep, rat, 
dog, cat, pig, horse, mouse, and human. 

The invention encompasses antibody fragments that specifically recognize a CEG 
polypeptide. As used herein, an antibody fragment is defined as at least a portion of the 
variable region of the immunoglobulin molecule that binds to its target, i.e., the antigen 
binding region. Some of the constant region of the immunoglobulin may be included. 
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As will be understood by those skilled in the art, the regions or epitopes of a CEG 
polypeptide to which an antibody is directed may vary with the intended application. For 
example, antibodies intended for use in an immunoassay for the detection of membrane- 
bound CEG proteins on viable bacterial cells should be directed to an accessible epitope 
on membrane-bound CEG proteins. Antibodies that recognize other epitopes may be 
useful for the identification of CEG protein within damaged or dying cells, for the 
detection of secreted CEG protein or fragments thereof. 

Various methods for the preparation of antibodies are well known in the art. For example, 
antibodies may be prepared by immunizing a suitable mammalian host using a CEG protein, 
peptide, or fragment, in isolated or immunoconjugated form (Harlow, 1 989 Antibodies, Cold 
Spring Harbor Press, NY). In addition, fusion proteins comprising CEG polypeptides may 
also be used, such as a CEG protein/GST-fusion protein. Cells expressing or overexpressing 
a CEG polypeptide may also be used for immunizations. Similarly, any cell engineered to 
express CEG protein may be used. This strategy may result in the production of monoclonal 
antibodies with enhanced capacities for recognizing endogenous CEG protein. 

The present invention contemplates chimeric antibodies that comprise a human and non- 
human immunoglobin portion. The antigen combining region (variable region) of a 
chimeric antibody can be derived from a prokaryotic source (e.g., bacteria) and the 
constant region of the chimeric antibody which confers biological effector function to the 
immunoglobulin can be derived from a eukaryotic source (e.g., human). The chimeric 
antibody should have the antigen binding specificity of the prokaryotic antibody 
molecule and the effector function conferred by the eukaryotic antibody molecule. 

In one example, the procedure used to produce chimeric antibodies can involve the 
following steps: 

a) Identifying and cloning the correct immunoglobin gene segment encoding the 
antigen binding portion of the antibody molecule. This gene segment is known as 
the VDJ, variable, diversity and joining regions for heavy chains or VJ, variable, 
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joining regions for light chains or simply as the V or variable region. This gene 
regions may be in either the cDNA or genomic form; 

b) Cloning the gene segments encoding the constant region or desired part thereof; 

c) Ligating the variable region with the constant region so that the complete chimeric 
antibody is encoded in a form that can be transcribed and translated; 

d) Ligating this construct into a vector containing a selectable marker and gene control 
regions such as promoters, enhancers and poly(A) addition signals; 

e) Amplifying this construct in bacteria; 

f) Introducing this DNA into eukaryotic cells (transfection) most often mammalian 
lymphocytes; 

g) Selecting for cells expressing the selectable marker; 

h) Screening for cells expressing the desired chimeric antibody; and 

k) Testing the antibody for appropriate binding specificity and effector functions. 

Chimeric antibodies of several distinct antigen binding specificities have been produced 
by protocols well known in the art, including anti-TNP antibodies (Boulianne et al., 1984 
Nature 312:643); and anti-tumor antigen antibodies (Sahagan et al., 1986 J, Immunol 
137:1066). Likewise, several different effector functions have been achieved by linking 
. new sequences to those encoding the antigen binding region. Examples of these include 
enzymes (Neuberger et al., 1984 Nature 312:604); immunoglobulin constant regions 
from another species and constant regions of another immunoglobulin chain (Sharon et 
al., 19S4 Nature 309:364; Tan et al., 1985 J. Immunol. 135:3565-3567). Additionally, 
procedures for modifying antibody molecules and for producing chimeric antibody 
molecules using homologous recombination to target gene modification have been 
described (Fell et al., 1989 Proc. Natl Acad Sci USA 86:8507-8511). 

The predicted amino acid sequence of a CEG protein may be used to select specific regions 
of the CEG protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of a CEG polypeptide may be used to identify hydrophobic and 
hydrophilic regions in the CEG protein. Regions of the CEG protein that show 
immunogenic structure, as well as other regions and domains, can readily be identified using 
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various other methods known in the art, such as Chou-Fasman, Garnier-Robson , Kyte- 
Doolittle, Eisenberg, Kaiplus-Schult or Jameson- Wolf analysis. Fragments that include the 
immunogenic regions are particularly suited for generating specific classes of antibodies. 

5 Methods for preparing a protein for use as an immunogen and for preparing immunogenic 
conjugates of a protein with a cairier such as BSA, KLH, or other carrier proteins are well 
known in the art. In some circumstances, direct conjugation using, for example, 
carbodiimide reagents may be used; in other instances linking reagents such as those 
supplied by Pierce Chemical Co., Rockford, IL, may be effective. Administration of a CEG 
1 0 immunogen is conducted generally by injection over a suitable time period and with use of a 
suitable adjuvant, as is generally understood in the art. During the immunization schedule, 
titers of antibodies can be taken to determine adequacy of polyclonal antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 
15 applications, for pharmaceutical compositions, monoclonal antibody preparations are 
preferred. . Immortalized cell lines which secrete a desired monoclonal antibody may be 
prepared using the standard method of Kohler and Milstein {Nature 256: 495-497) or other 
techniques as described in Monoclonal Antibodies; A Manual of Techniques, CRC press, 
Inc., Boca Raton, Fla.(1987) ed. Zola. The immortalized cell lines secreting the desired 
20 antibodies are screened by immunoassay in which the antigen is the CEG polypeptide 
having binding activity, or a fragment thereof. When the appropriate immortalized cell 
culture secreting the desired antibody is identified, the cells can be cultured either in vitro or 
by production in ascites fluid. 

25 The desired monoclonal antibodies are then recovered from the culture supernatant or from 
the ascites supernatant. Fragments of the monoclonal antibodies of the invention or the 
polyclonal antisera (e.g., Fab, F(ab'>2, Fv fragments, fusion proteins) which contain the 
immunologically significant portion (i.e., a portion that recognizes and binds a CEG protein) 
can be used as antagonists, as well as the intact antibodies. Humanized antibodies directed 

30 against a CEG polypeptide are also useful. The advantage of using humanized antibodies is 
that they are less immunogenic in humans. As used herein, a humanized antibody is an 
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immunoglobulin molecule which is capable of binding to a CEG polypeptide and which 
comprises a FR region having substantially the amino acid sequence of a human 
immunoglobulin and a CDR having substantially the amino acid sequence of non-Human 
immunoglobulin or a sequence engineered to bind a CEG protein. Methods for humanizing 
murine and other non-human antibodies by substituting one or more of the non-human 
antibody CDRs for corresponding human antibody sequences are well known (Jones et al., 
1986 Nature 321 : 522-525; Riechmnan et al., 1988 Nature 332: 323-327; Verhoeyen et al., 
1988 Science 239: 1534-1536; Carter et al, 1993 Proa Natl Acad ScL USA 89: 4285; 
and Sims et al., 1993 J. Immunol 151: 2296). 

Use of immunologically reactive fragments, such as the Fab, Fab 1 , or F(ab^2 fragments is 
often preferable, especially in a therapeutic context, as these fragments are generally less 
immunogenic than the whole immunoglobulin. Further, bi-specific antibodies specific for 
two or more epitopes may be generated using methods generally known in the art. Further, 
antibody effector functions may be modified so as to enhance the therapeutic effect of the 
antibodies of the invention. For example, cysteine residues may be engineered into the Fc 
region, permitting the formation of interchain disulfide bonds and the generation of 
homodimers which may have enhanced capacities for internalization, ADCC and/or 
complement-mediated cell killing (Caron et al., 1992 J. Exp. Med 176: 1191-1195; 
Shopes, 1992 J. Immunol 148: 2918-2922). Homodimeric antibodies may also be 
generated by cross-linking techniques known in the art (Wolff et al. Cancer Res, 53: 2560- 
2565). The invention also provides pharmaceutical compositions having the monoclonal 
antibodies or anti-idiotypic monoclonal antibodies of the invention. 

The antibodies or fragments may also be produced, using current technology, by 
recombinant means. Regions that bind specifically to the desired regions of the CEG 
protein can also be produced in the context of chimeric or CDR grafted antibodies of 
multiple species origin. The invention includes an antibody, e.g., a monoclonal antibody 
which competitively inhibits the immunospecific binding of any of the monoclonal 
antibodies of the invention to a CEG protein. 
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Alternatively, methods for producing fUUy human monoclonal antibodies, include phage 
display and transgenic methods, are known and may be used for the generation of human 
monoclonal antibodies (reviewed in: Vaughan et al., 1998 Nature Biotechnology 16: 535- 
539). For example, fully human monoclonal antibodies may be generated using cloning 
technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, "Building an in vitro immune system: human antibodies from 
phage display libraries", in: Protein Engineering of Antibody Molecules for Prophylactic 
and Therapeutic Applications in Man, Clark, M. (Ed.), Nottingham Academic, pp 45-64 
(1993); Burton and Barbas, "Human Antibodies from combinatorial libraries" Id., pp 65- 
82). Fully human monoclonal antibodies may also be produced using transgenic mice 
engineered to contain human immunoglobulin gene loci as described in PCT Patent 
Application W098/24893, Jakobovits et al., published December 3, 1997 (see also, 
Jakobovits, 1998 Exp. Opin Invest. Drugs 7: 607-614). This method avoids the in vitro 
manipulation required with phage display technology and efficiently produces high affinity, 
authentic human antibodies. 

The antibody or fragment thereof of the invention may be labeled with a detectable 
marker or conjugated to a second molecule, such as a therapeutic agent (e.g., a cytotoxic 
agent) thereby resulting in an immunoconjugate. For example, the therapeutic agent 
includes, but is not limited to, an anti-tumor drug, a toxin, a radioactive agent, a cytokine, 
a second antibody or an enzyme. Further, the invention provides an embodiment wherein 
the antibody of the invention is linked to an enzyme that converts a prodrug into a 
cytotoxic drug. 

Examples of cytotoxic agents include, but are not limited to ricin, ricin A-chain, 
doxorubicin, daunorubicin, taxol, ethiduim bromide, mitomycin, etoposide, tenoposide, 
vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria 
toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, arbrin A chain, modeccin A chain, 
alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, 
calicheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other 
chemotherapeutic agents, as well as radioisotopes such as 212 Bi, 13, 1, 13I In, 90 Y, and 186 Re. 
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Suitable detectable markers for diagnostic used include, but are not limited to, a 
radioisotope, a fluorescent compound, a bioluminescent compound, chemiluminescent 
compound, a metal chelator or an enzyme. Antibodies may also be conjugated to an anti- 
cancer pro-drug activating enzyme capable of converting the pro-drug to its active form. 
See, for example, U.S. Patent Nos. 4,952,394 and 5,716,990. 

Additionally, a recombinant protein of the invention comprising the antigen-binding 
region of any of the monoclonal antibodies of the invention can be made. In such a 
situation, the antigen-binding region of the recombinant protein is joined to at least a 
functionally active portion of a second protein having therapeutic activity. The second 
protein can include, but is not limited to, an enzyme, lymphokine, oncostatin or toxin. 
Suitable toxins include those described above. 

Techniques for conjugating or joining therapeutic agents to antibodies are well known 
(Axnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", 
in: Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56, Alan R. 
Liss, Inc. 1985; Hellstrom et al., "Antibodies For Drug Delivery", in: Controlled Drug 
Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53, Marcel Dekker, Inc. 1987; Thorpe, 
"Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in: Monoclonal 
Antibodies f 84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 
(1985); and Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin 
Conjugates", in: Immunol Rev., 62:119-58 (1982)). Techniques for joining detectable 
markers to antibodies are also known. 

D) PHARMACEUTICAL COMPOSITIONS OF THE INVENTION 

The invention includes pharmaceutical compositions for use in the treatment of microbial 
infections comprising a pharmaceutically effective amount of an anti-CEG antibody or a 
CEG polypeptide. 
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In one embodiment, the pharmaceutical compositions may comprise a CEG antibody, 
either unmodified, conjugated to a therapeutic agent (e.g., drug, toxin, enzyme or second 
antibody) or in a recombinant form (e.g., chimeric or bispecific). The compositions may 
additionally include other antibodies or conjugates (e.g., an antibody cocktail). 

5 

The pharmaceutical compositions also preferably include suitable carriers and adjuvants 
which include any material which when combined with the molecule of the invention 
(e.g., an anti-CEG antibody or a CEG protein) retains the molecule's activity and is non- 
reactive with the subject's immune systems. Examples of suitable carriers and adjuvants 

10 include, but are not limited to, human serum albumin, ion exchangers, alumina, lecithin, 
buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, and salts or 
electrolytes such as protamine sulfate. Other examples include any of the standard 
pharmaceutical carriers such as a phosphate buffered saline solution, water, emulsions 
such as oil/water emulsion, and various types of wetting agents. Other carriers may also 

15 include sterile solutions, tablets including coated tablets and. capsules. Typically such 
carriers contain excipients such as starch, milk, sugar, certain types of clay, gelatin, 
stearic acid or salts thereof, magnesium or calcium stearate, talc, vegetable fats or oils, 
gums, glycols, or other known excipients. Such carriers may also include flavor and 
color additives or other ingredients. Compositions comprising such carriers are 

20 formulated by well known conventional methods. Such compositions may also be 
formulated within various lipid compositions, such as, for example, liposomes as well as 
in various polymeric compositions, such as polymer microspheres. 

The pharmaceutical compositions of the invention can be administered using 
25 conventional modes of administration including, but not limited to, intravenous, 
intraperitoneal, oral, ijitralymphatic or administration directly into the tumor. 
Intravenous administration is preferred. 

The pharmaceutical compositions of the invention may be in a variety of dosage forms 
30 which include, but are not limited to, liquid solutions or suspensions, tablets, pills, 
powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and 
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injectable or infusible solutions. The preferred form depends upon the mode of 
administration and the therapeutic application. 

The CEG polypeptides and proteins of this invention are found in common pathogenic 
5 bacterial species such as Streptococcus pneumoniae. This organism causes upper 
respiratory tract infections. Thus, the peptides and proteins of this invention can be used 
as immunogensin subunit vaccines for vaccination against a pathogenic bacteria such as 
Streptococcus pneumoniae. Additionally, the ceg sequences of the invention can be used 
as DNA vaccines (U.S. Patent No. 5,736,524 and U.S. Patent No. 5,989,553). 

10 

The polypeptides and proteins of this invention can be formulated . as univalent and 
multivalent vaccines. The protein can be mixed, conjugated or fused with other antigens, 
including B or T cell epitopes of other antigens. 

15 Further, when a haptenic peptide of the proteins of the invention is used, (i.e., a peptide 
which reacts with cognate antibodies, but cannot itself elicit an immune response), it can 
be conjugated to an immunogenic carrier molecule. Conjugation to an immunogenic 
carrier can render the oligopeptide immunogenic. Examples of carrier molecules are 
tetanus toxin or toxoid, diphtheria toxin or toxoid and any mutant forms of these proteins 

20 such as CRM.sub!l97. Others include exotoxin A of Pseudomonas, the heat labile toxin 
of E. coli and rotaviral particles (including rotavirus and VP6 particles). Alternatively, a 
fragment or epitope of the carrier protein or other immunogenic protein can be used. For 
example, the happen can be coupled to a T cell epitope of a bacterial toxin. 

25 In formulating the vaccine compositions with the CEG polypeptides or proteins of the 
invention, alone or in the various combinations described, the immunogen is adjusted to 
an appropriate concentration and formulated with any suitable vaccine adjuvant. Suitable 
adjuvants include, but are not limited to: surface active substances, e.g., hexadecylamine, 
octadecylamine, . octadecyl amino acid esters, lysolecithin, dimethyl- 

30 dioctadecylammonium bromide), methoxyhexadecylgylcerol, and pluronic polyols; 
polyamines, e.g., pyran, dextransulfate, poly. IC, carbopol; peptides, e.g., muramyl 
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dipeptide, dimethylglycine, tuftsin; oil emulsions; and mineral gels, e.g., aluminum 
hydroxide, aluminum phosphate, etc. and immune stimulating complexes. The 
immunogen may also be incorporated into liposomes, or conjugated to polysaccharides 
and/or other polymers. 

5 

The vaccines can be administered to a human or animal in a variety of ways. These 
include intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, oral and 
intranasal routes of administration. Further, the vaccines can be live or inactivated 
vaccines. 

10 

The most effective mode of administration and dosage regimen for the compositions of 
this invention depends upon the severity and course of the disease, the patient's health 
and response to treatment and the judgment of the treating physician. Accordingly, the 
dosages of the compositions should be titrated to the individual patient. 

15 

E) USES OF THE MOLECULES OF THE INVENTION 

1) MOLECULAR WEIGHT MARKERS 

20 The nucleic acid molecules of the invention and their encoded proteins may be employed 
as molecular weight markers. For example, the molecular weight of each of the nucleic 
acid molecules having ceg sequences and their predicted polypeptides can be determined 
and can be used to compare against other gene sequences and proteins whose molecular 
weights are unknown. 

25 

2) DIAGNOSTICS 

The nucleic acid molecules of the invention may be employed in diagnostic 
embodiments. For example, the presence of nucleotide sequences which are identical or 
30 similar to the ceg sequences of the invention may be detected within a biological sample. 
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The biological sample may include blood, serum or a swab from nose, ear or throat, may 
be determined by means of a nucleic acid detection assay. 

Nucleic acid probes or primers having sequences complementary to ceg sequences may 
be used in a hybridization assay to detect the presence of the sequences which are 
identical or similar to the ceg sequences of the invention in the biological samples. 
Typically, nucleic acids molecules obtained from a suitable biological sample are 
hybridized with labeled probes or primers. The resulting hybridized molecules are 
detected and resolved by methods well known in the art , such as Northern or Southern 
blotting, micro-array technology, or amplifying with PCR technology. Other 
hybridization techniques and systems are known that can be used in connection with the 
detection aspects of the invention, including diagnostic assays such as those described in 
Falkow et al., U.S. Pat. No. 4,358,535. 

. Examples of the PCR technology are disclosed in U.S. Patent Nos. 4,683,202 and 
4,965,188 (incorporated herein by reference). Generally, nucleic acid molecules are 
obtained from a suitable biological source and contacted with two primers corresponding 
to the ceg sequences disclosed herein, under conditions which allow for hybridization and 
polymerization to occur. A pair of probes, one corresponding to the 5 ! flanking region 
and the other corresponding to the 3' flanking region, would be sufficient to detect the 
nucleic acid molecules of the invention in a biological sample and may be used to 
indicate the amount of bacteria present. 

Alternative methods of detecting nucleic acid molecules include, for example, in situ 
hybridization techniques, where a ceg probe is used to detect homologous sequences 
within one or more cells, such as cells within a clinical sample or even cells grown in 
tissue culture. As is well known in the art, the cells are prepared for hybridization by 
fixation, e.g. chemical fixation, and placed in conditions that allow for the hybridization 
of a detectable probe with nucleic acids located within the fixed cell. 
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The amount of ceg sequences present in a biological sample can be quantified and 
compared to the levels in a normal or "healthy" sample. For example, ceg sequences 
present in either increased or decreased levels, compared to the levels found in the 
control sample may indicate the presence of bacteria. This information is useful for 
5 * diagnosis of a bacterial infection that requires treatment with an antibacterial agent. 

Alternatively, the amount of CEG polypeptides present in a biological sample may be 
determined by means of an immunoassay. For example, labeled antibodies reactive 
against CEG polypeptides may be used in an immuno-reactive assay to detect the 
10 presence of CEG polypeptides in the biological samples. 

3) SCREENING CANDIDATE CEG SEQUENCES 

a) Gene Disruption Assay 

15 

The ceg nucleotide sequences of the invention can be used to identify nucleotide 
sequences which are identical or similar to the ceg sequences that are required for 
bacterial cell viability. For example, the ceg sequences can be used in a bacterial gene 
disruption assay to screen candidate nucleotide sequences to identify sequences required 
20 for bacterial cell viability. 

The disruption assay can involve: introducing into a host cell a recombinant vector that is 
capable of integration into the host genome, where the recombinant vector, includes a 
candidate sequence that putatively encodes a cell-viability gene product (e.g., the 
25 exogenous ceg sequence); the vector integrates the candidate sequence into a target 
sequence within the host's genome (e.g., the endogenous ceg sequence); and the host cell, 
so introduced, is screened for viability. The recombinant vector preferably includes a 
selectable marker so that the introduced host cell can be screened for viability in the 
presence of a selectable agent. 

30 
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For example, Figure 1 shows a schematic representation of a gene disruption assay, 
within a bacterial host cell. In Figure 1 A, the recombinant vector, pEVP3, includes the 
CAT gene (e.g., the selectable marker chloramphenicol acetyl transferase) and an internal 
region of the ceg disrupting sequence; the internal region excludes the 5' and 3' ends of 
the ceg sequence. The "X" in Figure 1 indicates the recombinant pEVP3 vector undergoing 
homologous recombination with the target sequence (e.g., within the host genome). In 
Figure IB, the resolved pEVP3 vector that is integrated into the host genome, is shown. 
Left to right are the following elements: the native promoter of the target gene; a 5' partial 
copy of the target gene; the body of the integrated pEVP3 vector including the disrupting 
gene and CAT; and, a 3' partial copy of the target gene. Thus, integration of the pEVP3 
vector via homologous recombination results in two partial gene duplications flanking the 
integrated vector. If the target gene is not essential for survival, it is possible to recover 
chloramphenicol-resistant colonies of £ pneumoniae. Failure to recover chloramphenicol 
resistant colonies, in the presence of the proper controls as described below, indicates that 
the target gene may be essential for cell viability. 

More particularly, the gene disruption assay for screening candidate ceg sequences can 
involve the following steps. The recombinant pEVP-3 . vector encoding CAT resistance 
and having a fragment of a candidate ceg sequence, can be introduced into 
transformation-competent S. pneumoniae cells by methods that are well-known in the art 
(Lee, M.S., et al., 1998 Appl Environ. Microbiol 64:4796-4802). The preferred size of 
the ceg fragment can be between about 200 to about 500 bp in length. It is advantageous 
that the candidate ceg sequence does not include the 5' and 3' ends that encode the Isl- 
and C-terminal ends of the CEG polypeptide. This insures that the inserted ceg fragment 
and the disrupted endogenous ceg gene sequence are not capable of expression of a full- 
length, functional ceg gene product. The transformation-competent cells can be obtained 
by performing the transformation step in the presence of a heptadecapeptide that induces 
competence for transformation of S. pneumoniae (Havarstein, L. S., et al., 1995 Proc. 
Natl Acad Sci. 92:11140-11144), such as the CSP-1 peptide. The CSP-1 can be 
naturally-derived or synthetic. Additionally, the transformation step can.be optimized by 
performing the transformation when the cells have reached a density which is optimal for 
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transformation (e.g., 3 X 10 7 cells per ml.) (Havarstein, L. S. et al. supra). The 
recombinant vector can be introduced into the competent pneumococci and may undergo 
homologous recombination, whereby the candidate ceg fragment recombines with the 
corresponding endogenous ceg sequence, resulting in targeted integration of the vector 
into the pneumococcal genome and disruption of the endogenous ceg. 

The transformed cells can be plated on or cultured in chloramphenicol-containing growth 
medium. The cells can be cultured under standard conditions, such as 37° C in 5% CO2 
for approximately 40 to 48 hours, for the purpose of selecting cells that carry the 
integrated vector. 

Additionally, control samples can be run in parallel with the gene disruption assay, in 
order to determine whether the gene disruption procedure is working properly. For 
example, the control samples can be used to calibrate the gene disruption experiment so 
that disruption of a known non-essential bacterial gene results in an approximate number 
of colonies per plate. Similarly, the disruption of a known essential gene can be 
calibrated to yield only zero or one colony per plate. The appearance of one colony is 
due to the rare illegitimate recombination into a non-homologous sequence. In particular, 
a known non-essential gene such as the lytA gene (Tomasz, A., et al., 1988 J Bacteriol 
170:5931-5934) can be used so that between about 70 to 100 chloramphenicol-resistant 
colonies will grow per plate. Similarly, the ftsZ gene (Lutkenhaus, J. F., et al., 1980 J. 
Bacteriol 143:1281-1288), a known essential gene, can be used to yield zero or, rarely, 
one colony per plate. As is well known in the art, specific parameters that are involved in 
any given gene disruption assay can be adjusted to calibrate the desired number of plated 
cells in the control samples. Experimental parameters that can be adjusted include, but 
are not limited to, the E. coli strain used to propagate the vector/insert, the fragment 
length of the sequence to be integrated, the amount of recombinant integration vector 
used to transform the cells, use of transformation-competent cells, and plating density of 
the transformed cells. 
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The transformed cells carrying the recombinant integration vector that disrupts 
expression of an endogenous essential gene (e.g., the target ceg gene) can be identified, 
based on a selectable phenotype such as non-viability. For example, the cells that carry a 
disrupted non-essential gene will be viable and, due to the integration of pEVP3, will 
grow on chloramphenicol-containing medium. In contrast, cells that carry a disrupted 
essential gene will not grow (e.g., non-viable) on the chloramphenicol-containing 
medium. Thus, the transformed cells that do not grow under these selective conditions 
carry an endogenous gene sequence that is essential for cell viability which has been 
disrupted by an exogenous candidate fragment, thereby identifying a ceg sequence. Steps 
one through three may be repeated in order to confirm that the ceg sequences, so 
identified, are essential for cell viability. 

b) Autolysin Assay 

It is advantageous to perform additional steps to determine whether the homologous 
recombination events result in disruption of the intended target gene sequence. The lytA 
transformation control can be used to confirm that the transformation system is 
functioning properly. For example, a phenotypic test for autolysin activity (lytA gene 
product) can be performed to determine that the exogenous lytA fragment is correctly 
integrated into the lytA site within the host genome. This typically involves flooding the 
culture plates containing transformants carrying the integrated lytA control vector with a 
solution of detergent, such as 0.1% deoxycholate, which triggers cell lysis in -intact 
cells (e.g., the cells that have not undergone homologous recombination). After about 5- 
10 minutes the colonies with intact lytA will appear ghost-like due to cell lysis, and the 
colonies with a disrupted lytA gene will appear intact. 

c) Polarity Analysis 

The ceg sequences that are confirmed to be essential for cell viability can be examined 
further by performing a polarity analysis to determine if the corresponding endogenous 
ceg sequence is organized in an operon. Polarity is an effect unique to prokaryotes and is 
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the result of the operon organization of bacterial genomes. Many bacterial genes are 
arranged in operons in which multiple genes are under the control of a single regulatory 
sequence (e.g., a promoter) and are transcribed into a single mRNA transcript. With 
respect to the orientation of multiple genes within an operon, the genes that are proximal 
to the regulatory sequence are said to be "upstream" genes and the genes that are distal 
are said to be "downstream" genes. For example, many operons contain genes encoding 
different proteins that catalyze discrete steps of a common biochemical pathway. Thus, 
any of the proteins that catalyze the steps of the pathway may be essential for cell 
viability. 

The presence of operons in a bacterial host genome may influence the interpretations of 
the gene disruption results. For example, disruption of an upstream gene may be 
erroneously interpreted as affecting the expression of the disrupted gene but may, in fact, 
have expression affects on the intact downstream genes. Therefore, it is advantageous to 
perform a polarity analysis to determine if a ceg sequence is part of an operon. 

A polarity analysis can involve performing an in vivo gene disruption procedure using^ as 
the disrupting sequence, a ceg sequence that includes the entire ceg coding sequence 
region but lacking expression regulatory sequences. This differs from the gene disruption 
assay, which involves the central region of the ceg sequence. The polarity analysis 
involves gene duplication via homologous recombination. For example, the pEVP-3 
vector having the entire coding region of a ceg sequence can be used for the polarity 
analysis (Figure 2 A). The polarity analysis will yield different results depending on the 
organization of the endogenous target sequence within the host genome. 

For example, Figure 2 shows a schematic representation of the polarity test for operons, 
^syithin a bacterial host cell. In Figure 2A, the recombinant vector, pEVP3, includes the 
CAT gene and the entire coding region of the ceg disrupting sequence. The "X" in Figure 
2 indicates the recombinant pEVP3 vector undergoing homologous recombination with the 
target sequence. Two of the possible results of homologous recombination are shown in 
Figures 2 B and C. 
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In Figure 2 B, case 1, if the endogenous target sequence is not organized in an operon, the 
integration event may yield: a functional target sequence (e.g., it is capable of 
expression); a duplicate non-functional target sequence that lacks a promoter; and a 
5 functional downstream gene (e.g., Gene B) that is controlled by its own promoter. The 
cells carrying this type of integrated target sequence can be recovered as viable cells that 
grow in the presence of chloramphenicol; this condition is termed "polarity negative". 

In Figure 2 C, case 2, if the target sequence is organized in an operon, then the integration 
10 event may yield an integration site that is similar to that described for case 1, including: a 
functional target sequence; and a duplicate non-functional target sequence which is not 
functional. However, this integration event may also yield a non-functional downstream 
gene (e.g., Gene B) because expression of this downstream gene is controlled by a 
promoter located upstream of the insertion site. The cells that carry this type of 
15 integrated target sequence will be non- viable; this condition is termed "polarity positive". 
Thus, the polarity analysis provides a method to determine whether integration of a 
. recombinant vector into a target ceg sequence effects expression of downstream genes. 

The ceg sequences disclosed herein (SEQ ID NOs.: 1-113, 227-331) encode gene 
20 products that are essential for viability in S. pneumoniae. Furthermore, many of these 
ceg sequences have been analyzed for the polarity effect and the results are presented in 
Table I. One subset of ceg sequences is classified as polarity negative (-), since the 
homologous recombination event did not effect the expression of downstream genes. 
Another subset of ceg sequences is classified as polarity positive (+), since the 
25 homologous recombination event did affect the expression of downstream genes. The 
ceg sequences that have not yet been classified as polarity positive or negative are 
indicated in Table I as a blank. For the ceg sequences that are classified as polarity 
positive, the genes downstream of the disrupted endogenous ceg sequences may or may 
riot also be essential. 

30 
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4) ASSAYS FOR IDENTIFYING CEG LIGANDS AND OTHER 



BINDING AGENTS 



The present invention provides screening methods for identifying agents that interact 
and/or bind to the CEG proteins of the invention, such as a ligand. An agent can be, for 
example, a natural product, a derived or synthetic chemical molecule, a polypeptide, a 
nucleic acid molecule, or a metal. The agents that interact with CEG proteins may cause 
bacterial cell death by disrupting the functions of CEG proteins, including, but not 
limited to, nucleotide biosynthesis, DNA replication, RNA transcription, protein 
translation, and/or cell wall biosynthesis, Accordingly, the present invention provides 
screening methods for identifying agents having antibacterial activity, such as agents that 
cause bacterial cell death by interacting with the CEG proteins. These antibacterial . 
agents are useful for treating diseases and afflictions associated with bacterial infections. 

Various methods can be used to discover agents having antibacterial activity, as 
determined by the ability of the binding agent to bind to a CEG protein and disrupt the 
function of the CEG protein. These screening methods include whole cell in vivo assays 
as well as in vitro assays with cellular components. 

An in vivo screening method for identifying ligands that bind CEG polypeptides can be 
performed in a whole cell assay. A typical method may be the use of whole bacterial 
cells to assess the antibacterial properties based on cell growth or viability. These 
methods can include methods for measuring cell growth and/or viability, for example, by 
optical density or zones of growth (Koch, A. L. et al., 1970 Anal. Biochem. 38:252-259; 
Biemer, J. J. et al., 1973 Ann, Clin Lab. Sci. 2:135-140; Manual of Clinical 
Microbiology; 7 th edition, Murray, P. R. (ed), ASM Press), by growth inhibition in an 
agar assay (Murray, P. R., supra), or other means of detecting cell metabolism 
(Mychajlonka, M. et al., 1980 Antimicrob. Agents Chemother. 17:572-582), and are well 
known to those skilled in the art. In addition, there are molecular biology-based detection 
methods for use with whole bacterial cells, such as gene reporter assays, to monitor the 
effect* of the ligand on specific targets (Slauch, J. M., et al., 1991 Methods EnzymoL 
204:213-248). Examples of the reporter genes include, but are riot limited to, beta- 
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galactosidase, alkaline phosphatase, luciferase, and green fluorescent protein. For 
example, one embodiment provides a reporter system that monitors inhibition of DNA 
synthesis by fusing a reporter such as beta-galactosidase (lacZ) to genes known to be 
upregulated by the cessation of DNA synthesis as a result of the binding of ligands to the 
DNA synthetic apparatus. (Shurvinton, C. E., et al., 1982 Mol Gen. Genetics 185:352- 
355; Rosato, A., et al., 1998 Antimicrob. Agents Chemother 42:1392-1396). 

Alternatively, the yeast two-hybrid system (Fields, S. and Song, O. 1989, Nature 
340:245-246) may be adapted to screen for ligands that bind CEG polypeptides. Generally, 
the yeast two-hybrid system is performed in a yeast host cell carrying a reporter gene, and 
is based on the modular nature of the GAL transcription factor which has a DNA binding 
domain and a transcriptional activation domain. The yeast two-hybrid system relies on 
the physical interaction between a recombinant polypeptide that comprises the GAL 
DNA binding domain and another recombinant polypeptide that comprises the GAL 
transcriptional activation domain. The physical interaction between the two recombinant 
polypeptides reconstitutes the transcriptional activity of the transcription factor, thereby 
causing expression of the reporter gene. Either of the recombinant polypeptides used in 
the two-hybrid system can be generated to include a CEG polypeptide sequence to screen 
for binding partners of CEG. 

Another method uses the bacterial CEG proteins as the basis for in vitro assay systems to 
detect binding agents. Typically, the in vitro screening method comprises: a) generating 
the CEG protein of the invention, or membranes enriched in the CEG protein; b) 
exposing the CEG protein or membranes to a candidate agent; and c) detecting the 
interaction of the CEG protein with the agent by any suitable means. Additionally, the 
screening methods may be adapted to automated high-throughput procedures, such as 
PANDEX.RTM Baxter-Dade Diagnostics, allowing for efficient high-volume screening 
of candidate agents. 

An alternative method for screening potential, ligands involves an in vitro binding 
procedure. Typically, the CEG proteins of the invention can be produced using 
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recombinant DNA technology and host-vector systems as described herein. A candidate 
agent is introduced into a reaction vessel containing the CEG protein, of fragment 
thereof; the candidate agents may be detectable by methods such as, but not limited to, 
radioisotope or chemical labeling. Binding of the CEG protein by a candidate agent can 
be determined by any suitable means, including, for example, quantifying bound label 
versus unbound label using any suitable method. Binding of a candidate agent may also 
be detected by methods similar to an alternative physical method disclosed in U.S. Patent 
No. 5,585,277. In this method, binding of a candidate agent to a protein is assessed by 
monitoring the ratio of folded protein to unfolded protein, for example by monitoring 
sensitivity of the protein to a protease, or amenability to binding of the protein by a 
specific antibody against the folded state of the protein, or binding to chaperone protein, 
or by binding to any suitable surface. 

The invention provides methods of identifying compounds that modulate (e.g., activate or 
inhibit) the function of a CEG polypeptide. Essentially any compound can be used in the 
assays of the invention. The preferred compounds are those that are soluble in aqueous 
or organic solutions. It will be appreciated by those of skill in the art that there are many 
commercial suppliers of chemical compounds that can be used in the methods of the 
invention, including Sigma Chemical Co. (St. Louis, Mo.), Aldrich Chemical Co. (St. 
Louis, Mo.), Sigma-Aldrich (St. Louis, Mo.), Fluka Chemika-Biochemica Analytika 
(Buchs, Switzerland), and the like. 

The present invention provides methods for detecting compounds which are identified as 
modulators of CEG function. The methods of the invention can be performed using 
isolated CEG polypeptides, or use whole cells expressing the CEG polypeptide. The 
steps, of the method using isolated CEG polypeptides include: contacting the isolated 
CEG polypeptide with a candidate compound; and determining whether the function of 
the CEG polypeptide is altered. The steps of the method using whole cells include: 
contacting the whole cells with a candidate compound; and determining whether the cell 
dies, indicating the compound inhibited the function of a CEG polypeptide. 
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The preferred methods of the invention provide high-throughput screening assays for 
identifying compounds which modulate the function of a CEG polypeptide. The high 
throughput methods permit screening of large libraries of compounds. For example the 
high throughput methods can use automated assay steps. The assays can be performed 
in parallel on a solid support, as microtiter formats on microtiter plates in robotic assays 
are well known. A preferred embodiment of the methods includes adapting the methods 
to use microtiter plates or pico- nano- or micro-liter arrays. In high throughput assays it 
is desirable to run positive controls to ensure that the components of the assays are 
working properly. 

The high throughput screening methods of the invention include providing a 
combinatorial library containing a large number of compounds (candidate modulator 
compounds) (Borman, S, C & E News, 1999, 70(10), 33-48). Such combinatorial 
chemical libraries can be screened in one or more assays to identify library members 
(particular chemical species or subclasses) that exhibit the ability to modulate the 
function of the CEG polypeptide (Borman, S., supra; Dagani, R. C. & E. News, 1999, 
70(10), 51-60). The compounds, so identified, can serve as lead-compounds or can 
themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by using either chemical synthesis or biological synthesis, to combine a 
number of chemical building blocks, such as reagents. For example, a linear 
combinatorial chemical library, such as a polypeptide library, is formed by combining a 
set of chemical building blocks (amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound). Millions 
of chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to those of skill in 
the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries 
(see, e.g., U.S. Pat. No. 5,010,175, Furka, Int. J. Pept. Prot Res., 1991, 37:487-493 and 
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Houghton, et al., Nature, 1991, 354, 84-88). Other chemistries for generating chemical 
diversity libraries can also be used. Such chemistries include, but are not limited to, peptoids 
(PCT Publication Np. WO 91/19735); encoded peptides (PCT Publication WO 93/20242); 
random bio : oligomers (PCT Publication No. WO 92/00091); benzodiazepines (U.S. Pat. No. 
5,288,514); diversomers, such as hydantoins, benzodiazepines and dipeptides (Hobbs, et al., 
Proc. Nat. Acad Sci. USA, 1993, 90, 6909-6913); vinylogous polypeptides (Hagihara, et al., J. 
Amer. Chem. Soc. .1992, 114, 6568); nonpeptidal peptidomimetics with 6eta-D-glucose 
scaffolding (Hirschmann, et al., J. Amer. Chem. Soc, 1992, 114, 9217-9218); analogous 
organic syntheses of smallcompound libraries (Chen, et al., J. Amer. Chem. Soc, 1994, 116, 
2661; Armstrong, et al. Acc. Chem. Res., 1996, 29, 123-131); or small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&E News, 1993, Jan. 18, page 33,); 
oligocarbamates (Cho, et al., Science, 1993, 261, 1303); and/or peptidyl phosphonates 
(Campbell, et al., J. Org. Chem. 1994, 59, 658); nucleic acid libraries (see, Seliger, H et al., 
Nucleosides & Nucleotides, 1997, 16, 703-710); peptide nucleic acid libraries (see, e.g., U.S. 
Pat. No. 5,539,083); antibody libraries (see, e.g., Vaughn, et al., Nature Biotechnology, 1996, 
14(3), 309-314 and PCT/US96/10287); carbohydrate libraries (see, e.g., Liang, et al., Science, 
1996, 274, 1520-1522 and U.S. Pat. No. 5,593,853, Nilsson, UJ, et al., Combinatorial 
Chemistry & High Throughput Screening, 1999 2, 335-352; Schweizer, F; Hindsgaul, O. 
Current Opinion In Chemical Biology, 1999 3, 291-298); isoprenoids (U.S. Pat. No. 
5,569,588); thiazolidinones and metathiazanones (U.S. Pat. No. 5,549,974); pyrrolidines (U.S. 
Pat. Nos. 5,525,735 and 5,519,134); morpholino compounds (U.S. Pat. No. 5,506,337); 
benzodiazepines (U.S. Pat. No. 5,288,5 14); and other similar art. 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem. Tech, Louisville Ky., Symphony, Rainin, 
Woburn, Mass., 433A Applied Biosystems, Foster City, Calif., 9050 Plus, Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd., Moscow, RU, 3D Pharmaceuticals, Exton, 
Pa., Martek Bio sciences, Columbia, Md., etc.). 
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In the high throughput methods of the invention, several thousand different candidate 
compounds can be screened in a relatively short period of time. For example, each well 
of a microtiter plate can be used to run a separate assay against a selected potential 
modulator, or if concentration or incubation time effects are to be observed, every 5-10 
wells can test a single modulator. Thus, a single standard microtiter plate can assay about 
100 (96) modulators. If 1536 well plates are used, then a single plate can easily assay 
from about 100 to about 1500 different compounds. It is possible to assay many different 
plates per day; assay screens for up to about 6,000-20,000, and even up to about 100,000- 
1,000,000 different candidate modulator compounds are possible using the methods of 
the invention. 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

EXAMPLE 1 

The following provides a general description of how a list of candidate ceg sequences 
was generated. The list was generated by selecting candidate ceg gene sequences from a 
Concordance web engine using the method described in: Bruccoleri, R.E., Dougherty, 
T.J., Davison, D.B. (1998) "Concordance analysis of microbial genomes" in: Nucleic 
Acids Res 26:4482-4486. 

Microbial Genomics CEG Discovery Process Summary. 
Microbial Concordance Analysis 

The entire genomic sequence data of various bacteria was acquired from several public 
and proprietary sequence database sources, including GTC (Genome Therapeutics 
Corporation), and TIGR (The Institute for Genomic Research). 
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Predicted ORFs from the genomic data were identified, translated, and stored. The 
desirable ORFs were at least 90 amino acid residues in length. Concordance analysis 
was performed among bacteria and various parameters were used to filter out genes with 
high similarity to eukaryotes. 

Concordance Analysis 

The entire genomic sequence of various Eubacteria was acquired from several public and 
private sources. The proprietary PathoGenome System from Genome Therapeutics 
Corporation, Waltham, MA, USA contributed data.. Public data was obtained from 
GenBank (http://ncbi.nlm.nih.gov). The Institute for Genomic Research (TIGR), the 
Yeast Proteome Database, from Proteome, Inc. of Beverly, MA, and the Sanger Center of 
the Medical Research Council of the United Kingdom (http://www.sanger.ac.uk). 
Additionally, the non-microbial sequence data used as a basis for comparison and data 
subtraction was obtained from a proprietary database, including the LifeSeq Database 
from Incyte Pharmaceuticals, Palo Alto, CA. 

Where required, Incyte nucleotide sequences were translated into protein sequences in all 
six possible reading frames. GTC supplied predicted protein sequences with their data. In 
the case of other eubacterial nucleotide sequences, the program CRITTCA (Badger, J. and 
Olsen, G., 1999 "CRTTICA: coding region identification tool invoking comparative 
analysis" in: Molecular Biology and Evolution 16:512-524). The sequences were stored 
in flat files on a Unix computer system. Each predicted amino acid sequence had to be 
greater than 90 amino acids. 

Each predicted protein sequence was compared to every other sequence (an "all-against- 
all" comparison). The program used was FASTA (Pearson, W.R., "Flexible sequence 
similarity searching with the FASTA3 program package." Methods in Molecular Biology 
2000 132:185-219.) The parameters used were ktup=2, and all scores above the default 
cutoff were kept. The output was processed and stored in a PostGres 95 database 
(http://www.postgresql.org) . Graphical user interfaces, using web browser technology, 
were constructed to query the database. 
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A Concordance Analysis was performed on the data. The question used to generate the 
dataset was show all Streptococcus pneumoniae open reading frames with a similarity 
greater than or equal to 30% overall protein sequence identity to both selected gram- 
positive and/or gram-negative bacteria in the database. The data was further required not 
to match yeast or human sequences at greater than 30% overall protein sequence 
similarity. The resulting dataset included a list of more than 400 conserved amino acid 
sequences having known or unknown function. The amino acid sequences having 
unknown functions formed the basis of a list designated Conserved Unknown Reading 
Frames, or CURFs which is a subset of the total list of CEGs (e.g., CURFs includes 
known and unknown). 

The resulting list of conserved genes (e.g., more than 400 sequences) was used as a basis 
for selecting and screening bacterial gene sequences that are essential for ceil viability. 
The Concordance system was designed to permit high-throughput identification of 
conserved gene sequences in the database. (Bruccoleri, R, Dougherty, T, and Davison, D. 
1998 "Concordance analysis of microbial genomes" Nucleic Acids Res. 26:4482-4486.) 

Data Curation And Analysis 

Exact N-terminal and C-terminal translational start sites of genes were identified by 
pairwise similarity searches, multiple sequence alignments. Ribosome binding sites, 
terminators, nearby genes, operons were identified. 

The resulting list of conserved genes was used as a basis for selecting and screening 
bacterial gene sequences that are essential for cell viability. This Concordance system 
was designed to permit high throughput use of the conserved gene sequences contained . 
on the list. A set of Knockout PCR primers were generated, based on the list of 
conserved genes, for the purpose of use in the gene disruption procedure described 
below. The PCR primers were designed to amplify a central 300-500 bp region of the 
ceg (to prevent generation of a functional copy of the ceg gene following integration), 
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ordered electronically, the primers were placed in a 96-well format, and used in the gene 
disruption procedure as described below. 

EXAMPLE 2 

5 

The following provides a description of the procedure to generate recombinant vectors of 
pEVP-3 having inserts of candidate ceg nucleotide sequences. The Knockout primers 
generated by the method described in Example 1 above were used to generate DNA 
fragments comprising candidate ceg sequences. 

10 

Genomic PCR Knockout Target Fragment Generation 

96-well plate format were set up (36 |il H2O , 5 \x\ 10x Vent™ buffer, 1 \x\ gene specific, 
knockout forward primer (0.5 ^ig/|il), 1 ^1 gene specific knockout reverse primer (0.5 
15 Mg/^1), 0.5 \il Vent™ DNA polymerase (2000 U/ml New England Biolabs, Beverly, 
MA), 1.5 |il each dNTPs (lOmM; 6.0 \il total), 0.5 nl S. pneumoniae chromosomal DNA 
(0.5 (ig/^il), 50 \xl total volume/reaction). 

The nucleotide sequences of the forward and reverse knockout primer pairs were 
20 generated from the nucleotide sequence information obtained from the Genomic 
Therapeutics Corporation database for Streptococcus pneumoniae. The primer pairs were 
each used in a PCR reaction to generate a unique internal (e.g., central region) fragment 
of the candidate gene targeted for knockout. 

25 The PCR program was set in the PCR machine (Initial 95 °C - 5 minutes: 30 Cycles of: 
95 °C - 1 minute, 58 °C - 1 minute, 72 °C - 30 seconds; Final, 72 °C - 10 minutes, 4 °C - 
hold indefinitely). 5 \il of each reaction was run on an 0.8% agarose gel after purifying 
fragment over PCR purification kit (Qiagen) to visualize the fragments then ligation 
reactions were performed. 

30 * 
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Ligation Reactions proceeded (set up in. 96-well plate format (10.0 (il genomic PCR 
fragment (generated from step 2 above), 1.0 (il pEPV-3 Smal-cut vector (1:10 dilution of 
vector DNA at 50-100 ng/^1), 1.5 jil 10* ligation buffer (New England Biolabs™), 1.0 nl 
T4 DNA Ligase (New England Biolabs™ 400,000 U/ml), 1.5 \xl ddH 2 0, 15.0 \xl total 
reaction volume). 

Reactions were allowed to incubate in 96-well plate at 14 °C overnight in the PCR 
machine. Transformations into E. coli for in vivo amplification were proceeded the 
following day. 

The nucleotide sequences of the forward and reverse primer pairs used for the polarity 
test were generated in a similar manner, from the nucleotide sequence information 
obtained from the Genomic Therapeutics Corporation database for Streptococcus 
pneumoniae. The primer pairs were each used in a PCR reaction to generate a unique 
fragment of the candidate gene targeted for the polarity test. The fragment generated for 
the polarity test included the entire ceg coding sequence region but lacking the expression 
regulatory sequences. 

Transformation into E. coli (strain LE392): 

The next day, 3 nl of above ligation mix was used per transformation reaction plus 50 \il 
LE392 competent cells. Reactions were set up in 96-well plate format; incubated on ice 
for 30 minutes; heat-shocked at 42° C for 90 seconds; and incubated on ice 2 minutes; 
100 |il SOC media (Gibco BRL) was added; then incubated at 37° C on platform shaker 
for 1 hour; plated on LB/chloramphenicol (13.0 jig/ml) agar plates for constructs over 
night at 37° C with plates inverted and proceeded with colony PCR to confirm constructs. 
The universal primers flanking the insert site in pEVP-3 were used for PCR 
amplification. 

The colony PCR involved the following. 96-well plate format was set up (36.5 jil H 2 0, 
0.5 ^1 pEPV3 forward primer (0.25 ng/nl), 0.5 nl pEPV3 reverse primer (0.25 |ig/jal), 1.5 
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Hi each (6.0 \il total) dNTPs (10 mM), 0.5 nl Vent™ DNA polymerase, 5 ^1 10x Vent™ 
buffer, 1 jil of a 1:50 cell dilution, 50 ^1 total volume). 

pEPV3 forward primer: 5' CATCAAGCTTATCGATACCGTCG 3' (SEQ ID NO:437) 
p EPV3 reverse primer: 5 * CACAGTAGTTCACCACCTTTTCCC 3 9 (SEQ ID NO:43 8) 

Colonies of E. coli LE392 .were picked onto a master plate of LB + 13 ng/ml 
chloramphenicol (incubate throughout the day at 37° C) and then into 50 jal H 2 0 which 
has been placed into a 96-well plate. 1 ^1 of this dilution was used in above PCR reaction 
(if the 96-well dilution plate is kept you will not need to prepare a master plate). Cultures 
for minipreps of plasmid candidates may be prepared directly from the cell dilutions. 

The PCR program was run (95 °C - 5 minutes, 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 
minute, 72 °C - 30 seconds, 72 °C - 1 0 minutes, 4 °C - hold). 

A 10 |il/ reaction was run on a 1.0 % TBE gel. A gel designed for 96 well plates and a 
multichannel pipettor were used to ease loading of the sample rows. The gel was run and 
stained with ethidium bromide. The positive clones were identified with appropriate 
molecular size insert(s), amplified by the flanking pEVP-3 primers. 

Minipreps Of Plasmids To Identify Cells Carrying The Pevp-3 Vector With An Insert 

The constructs that carried an insert were identified. The constructs having an insert 
were inoculated into a 5 ml LB/Cm culture, and incubated over night at 37 °C with 
aeration. Miniprep plasmid DNA was prepared by a standard procedure. The miniprep 
. DNA was digested with appropriate restriction enzymes to confirm the presence of the 
insert (enzymes flank Smal site in pEVP-3) (10 \x\ miniprep DNA, 2 nl 10 x buffer, 1 |il 
Xbal, 1 ]i\ Xhol, 6 ^1 ddH20,* 20 \i\ total volume for digest). 
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To confirm the presence of an insert, the digest reactions were electrophoresed on an 
agarose gel and the gel was stained with ethidium bromide. The positive clones were 
used for the S. pneumoniae KNOCKOUTs procedure. 

The confirmatory PCR reactions, using knock but-specific primers (quality control step) 
involved 35.5 |il HkO, 5 \xl 10 x Vent™ buffer, 1 ^1 knockout forward primer (0.5 lig/|il), 
1 |il knockout reverse primer (0.5 |ig/|il) 3 0.5 |il Vent™ (6.0 \i\ total) DNA Polymerase 
(2000 U/ml), 1.5 \xl each dNTPs (lOmM, 6.0 \xl total), 1.0 |al miniprep DNA from test 
clone, 50 ^1 total reaction volume. The PCR program was as follows: 95 °C for 5 
minutes, 30 Cycles of: 95 6 C for 1 minute, 60 °C for 1 minute, 72 °C for 30 seconds, 72 
°C for 10 minutes, hold at 4 °C. The presence of the correct-sized insert was confirmed 
by agarose gel electrophoresis and ethidium bromide staining. The confirmed clones 
were used for the £ pneumoniae gene KNOCKOUT procedure. Glycerol stocks were 
made of all positive E. coli LE392 constructs and frozen at - 80 degrees C. 

EXAMPLE 3 

The following provides a description of the high throughput gene disruption procedure 
used in S. pneunorhiae strain (e.g., gene knockout procedure). The candidate ceg 
fragments that were generated by the method described in Example 2 were used in the 
gene disruption procedure in order to identify ceg nucleotide sequences that are required 
for cell viability. 

Reactions were set up in a 1.5 ml eppendorf tubes or 96 well plate (1 \xg total of miniprep 
pEVP-3 + insert DNA (usually 10 [il of Qiagen miniprep DNA); then 200 ^1 of S. 
pneumoniae (strain Rx-1) competent cells diluted 1:10 in competence media was added 
(1 ml of competence media = 980 Todd Hgjvitt (Difco Laboratories) with 0.5% yeast 
extract, 20 nl 10% BSA", 1 |il 10 % CaC12, and 0.5 \il (200 |Ig/ml) Csp-1 competence 
peptide). 
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Controls were run with each KNOCKOUT experiment and involved 1 jig pEPV3 Lyt A 
construct = positive control (non-essential), or 1 pg pEPV3 Fts Z construct = negative 
control (essential). Then the 96 well plates and controls were incubated at 37 °C for 2.5 
to 3 hours in 37 °C room without shaking. The 200 pi of the samples were plated on 
Todd Hewitt agar plates with 0.5% yeast extract and 2 jag/ml chloramphenicol. 

The samples were incubate over night at 37 °C in 5% CO2 incubator. Control plates were 
checked for presence of colonies (pEVP-3::lytA) and no growth (pEVP-3::ftsZ). Plates 
were examined for growth (ca. 70-150 colonies) designating nonessentials and zero 
colonies designating essential genes. 

The polarity test was performed in a similar manner, using the polarity fragments, 
described in Example 3. 

EXAMPLE 4 

The following provides a description of the autolysin procedure used to determine that 
the non-essential control samples of S pneumoniae contain a disrupted lyt A gene. 

Phenotvpic Autolysin Test 

The culture plates containing transformants carrying the lytA control vector were flooded 
with 0.1% deoxycholate in H2O. The plates were observed after 5-10 minutes. Plates 
with "ghosts" indicated intact lytA gene, or plates without "ghosts" indicated a disrupted 
lytA gene. The "ghost" phenomenon is due to detergent triggered autolysis of the cells, 
causing a gradual fading of the colonies. 

The detergent treatment triggers the autolysin in lytA intact cells; it cannot trigger the 
autolysin (lytA gene product) in lytA disrupted cells. Colonies with intact lytA "ghost" in 
5-10 minutes due to massive pneumococcal cell lysis. 
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EXAMPLES 

The following provides a description of the procedure used to express the CEG proteins 
(e.g., designated CFE proteins) in E. coli cells. 

CEG Protein Production 

Full-length ceg gene were inserted into pET-21 expression vector using the E. coli BL21 
ADE3 expression system using.the following method: 

For each ceg, custom primers were used to insert N- and C- termini into vectors such that 
the 5' end (N-terminus of the CEG) is positioned properly for expression behind the T7 
promoter and optimally placed with regard to the pET ribosome binding site. The pET 
vectors contain an Ndel site which allows positioning of ATG start site in the vector. In 
cases where the ceg sequence contains an internal Ndel site, blunt ligation of the ceg PCR 
fragment into the vector is accomplished via Klenow fill-in of the Ndel site. In many 
cases, primers were also designed such that the ceg 3' (C-terminus of the expressed 
protein) will contain an in-frame extension of 6X-histidine residues, encoded in the 
vector sequence of pET-21. The individual cegs were PCR amplified via custom 
designed primers as described above. Both ceg PCR andvector DNA were digested with 
appropriate restriction enzymes. The full-length ceg were ligated into the pET 
expression vector. The ligation mixture was transformed into competant E coli BL21 
XDE3 cells and selected for transformants on LB agar with 50 ng/ml ampicillin. Positive 
insert bearing clones were screened via minipreps of the plasmids and size analysis on 
0.8% agarose gels, with detection by ethidium bromide staining, as above. 

Protein Production 

The proper reading frame of each ceg inserted into pET-21 is verified by DNA 
sequencing. 
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A small (2-5 ml) test culture of E. coli BL21 XDE3 with the insert-bearing plasmid is 
tested for protein expression by IPTG induction of the expression vector for 1-2 hours. 
The expression is verified by SDS-Polyacrylamide Gel Electrophoresis analysis of a 
whole cell extract (SDS extract of 0.5-1 ml of cells treated at 100 °C for 5 minutes) to 
determine whether the protein is over-expressed and migrates at the correct predicted 
molecular weight. 

The protein is overproduced and purified, via the following method. A large scale (500- 
1000ml) culture of E. coli is grown to early logarithmic phase in broth (e.g., LB broth) 
and protein expression induced for 2 hours with IPTG (isopropyl-D-thiogalactoside). 
The cells are harvested by centrifugation (8000 X G; 15 minutes) and the cell pellets 
resuspended in 20 ml. of buffer. The cells are lysed by sonication, and the supernatant 
fluid centrifuged at low speed (5000 X G, 15 min.) to remove unbroken cells; The 
supernatant fluid, containing the over-expressed protein is subjected to Ni- NTA affinity 
column chromatography (Quiagen, Inc., Chatsworth, CA). The 6X-histidine residues 
linked at the C-terminal end of the CEG proteins permit rapid protein purification via 
selective binding to a Ni-NTA resin column. The protein-bound. Ni-NTA resin was to 
remove contaminants, and the bound proteins subsequently eluted with imidazole and 
recovered. It is possible to upscale this procedure to larger volumes for higher yields of 
proteins. 

EXAMPLE 6 

The following provides a description of the methods used to purify all 2CEG 
polypeptides (e.g., 2CFE polypeptides #19-117; SEQ ID NOS:349-436) having a 
histidine tag at their G-terminal ends. The 2CEG polypeptides having the his-tags were 
produced by the methods described in Example 5, supra. As an example, results of 
purification of 2CFE 75 polypeptide are presented. 
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Production Of The CFE Polypeptides 

The BL2UDE3 cells harboring recombinant pET-21 vectors carrying a 2CFE nucleotide 
sequence (SEQ ID NOS:244-331) were cultured in LB broth containing ampicillin. 
5 When the A600 reached approximately 0.6, protein production was induced by adding 1 .0 
mM of IPTG, the cells were cultured for an additional 2 hours. The cell pellet was 
collected by centrifiigation, and the collected cell pellet was sonicated in Solution A (50 
mM NaP0 4 ; 300 mM NaCl, pH 8.0). The sonicated cells were centrifiiged at 10,000 
RPM to remove the debris. 

10 

Purification Of The CFE Polypeptide 

The supernatant was diluted with Solution A, loaded onto a Ni-NTA column (Quiagen) 
equilibrated with Solution A; the column bed size was 2.5 x 25 cm, and the flow rate was 

15 approximately 3.0 ml/minute. The 2CFE protein was eluted using a linear gradient of 
imidazole, using 0-250 mM in 450 ml, flow rate approximately 3.0 ml/minute. The 
eluted samples were collected as 22 ml fractions per tube and the eluted samples were 
monitored using spectrophotometry. The amount of protein in the eluted fractions was 
estimated using the Bradford method (Bradford, M. M., 1976 Anal. Biochem. 72:248) and 

20 the samples were run on an SDS-PAGE gel (Novex EC6008) (Figure 3 A). Fractions 
were selected for pooling based on the results of the SDS-PAGE gel. The pooled 
fractions were concentrated using a 10,000 MW Centricon (Amicon) to approximately 5 
ml. 

25 The 2CFE 75 polypeptide, a precipitate formed and was redissolved upon increasing the 
sample volume and removing the imidazole by repeated concentration in 50 mM Tris, 
100 mM NaCl, pH 7.5. Varying amounts of the 2CFE 75 polypeptide were diluted in 
either 20 mM Tris, 20 mM KC1, pH 7.5 or 20 mM Tris, 20 mM MgCl 2 , pH 7.5 at 
• concentrations of 12, 24, or 36 ug/ml. The diluted samples were electrophoresed on an 

30 SDS-PAGE gel under non-reducing conditions (Figure 3 B). The results of Figure 3 B 
suggests that 2CFE 75 forms a multimer. 
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EXAMPLE 7 

The following provides a description of the methods used to purify CEG polypeptides 
5 that lack a histidine tag (e.g., 2CFE polypeptides #1-17; SEQ ID NOS:332-348). As an 
example, the results of purification of CFE 3 polypeptide are presented. 

Purification of the CFE 3 Polypeptide 

10 The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 (SEQ ID NO:334) polypeptide lacks a C- 
terminal histidine tag. The 2CFE 3 polypeptide was purified using a 2-column 
procedure. The 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose 
column (Pharmacia) using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the 

15 gradient size was 1 liter. Then the 2CFE 3 polypeptide was eluted from a hydroxyapatite 
Bio-gel column (Bio-Rad) using a 5-200 mM potassium phosphate (pH 8.0) gradient, the 
flow rate was 0.3 ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 
preparation was run on a polyacrylamide gel (Figure 4). 

20 EXAMPLE 8 

The following provides a description of the size exclusion chromatography methods used 
to estimate the molecular weight and determine whether the CEG polypeptides 
oligomerize. The CFE polypeptide may olimerize to form monomers, dimers, tetramers, 
25 hexameric rings, or other oligomeric forms. 

Size exclusion chromatography was performed on all isolated 2CFE polypeptides #s 1- 
117 (e.g., SEQ ED NOS:332-436). This method was performed using various types of 
columns, depending on the particular 2CFE polypepeptide tested. 

30 
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The Biosil SEC- 125 HPLC Gel Filtration column (BioRad Laboratories, Inc) was used, 
for example, to characterize CFE 8. The mobile phase was 0.2 M KH 2 P0 4 , 0.9% NaCl 
pH 6.8. 

5 The Phenomenex 600 x 7.5 mm Biosep SECS 3000 column was used, for example to 
characterize 2CFE 21 and 39. The mobile phase for size exclusion was 50 mM 
Na 2 HP04, pH 7.0 and 150 mM NaCl run at 1 ml/minute in a Gilson HPLC system, with 
protein detection at 280 nm. 

10 EXAMPLE 9 

The following provides a description of the computer-aided methods used to search for 
similarities between the amino acid sequences of the CEG polypeptides and sequences 
available through public and proprietary databases. In many cases, the function of the 
15 CEG polypeptides was suggested by the results of the similarity searches. The function 
of some of these CEG polypeptides has been confirmed by performing additional 
analyses. Table V provides a list of the suggested and confirmed functions of CEG 
polypeptides designated CFEs #1-117. 

20 The- putative function of the CFE polypeptides were determined using computer-aided 
bioinformatic approaches, including distant homologies, motif searching, or predictions 
based on statistical rules. For example, the distant homology approach involved pairwise or 
multiple sequence alignments, employing tools such as FASTA, and Psi-BLAST. The 
motif searching approach involved using sophisticated hidden Markov models. The 

25 approach based upon predictions of statistical rules involved prediction of transmembrane 
regions, coiled-coil, and other structural motifs. These approaches have been reviewed in 
Computational Methods In Molecular Biology 1998, eds. Salxber, S.L., Searls, D.B. Searls, 
and Kasif, S. , Elsevier, and in Bioinformatics: A Practical Guide To The Analysis Of Genes 
And Proteins 1998 eds Baxevanis, A. D. and Francis Ouellete, B.F. , Wiley-Interscience. 

30 
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Global sequence similarity searches were performed using the amino acid sequences of 
all the conserved essential gene sequences (e.g., CFEs 1-117; SEQ ID NOS:l 14-226) to 
search against a non-redundant protein database using the BLAST2 algorithm (Altschul 
S.F., et al., 1997 Nucleic Acids Res. 25(17):3389-3402). In a similar search, similar 
5 sequences were identified in the Concordance database using the "Neighbor" function 
(Bruccoleri R. E., Dougherty T.J., Davison D.B. 1998 Nucleic Acids Res. 26(19):4482- 
4486). To determine if the predicted amino acid sequences were full length and in the 
proper reading frame, BLAST-type searching and CLUSTAL multiple sequence 
alignments (Higgins D.G., et al., 1996 Methods Enzymol 266:383-402) were used. 

10 Local sequence similarity searches were performed, by searching for Prosite (Hofmann 
K. } et al., 1999 Nucleic Acids Res. 27(1):215-219) and Pfam motifs (Bateman A., et aL, 
2000 Nucleic Acids Res. 28(l):263-266). Additionally, the amino acid sequences of the 
CFEs were analyzed by performing protein threading analyses using the ProCeryon fold 
recognition program (Sippl, et al., 1992 Proteins 13:258-271; Sippl, J. 1993 J. Comp. 

1 5 Aided Mol. Design 7:473-501 ; www.proceryon.com) and Geneformatics. 

In bacteria, many operons include genes encoding different proteins that catalyze discrete 
steps of a common biochemical pathway. Therefore, the operon structures in S. 
pneumoniae was compared with that in other bacteria in order to predict the function of 
20 CFE polypeptides. 

Additionally, analysis of bacterial metabolic pathways were performed using Pathway 
Tools from DoubleTwist, based on the EcoCyc system (Karp P.D., et al., 1999 Nucleic 
Acids Res. 1999 27(l):55-58). This analysis was used to predict which CFEs mediate 
25 various steps of the pathways. 

When the sequence identity between a CFE polypeptide and the annotated database (e.g., 
SwissProt, Genbank) was low (e.g., sequence identity less than about 30%), a Protein 
Threading (e.g., fold recognition) method was used to predict similarities in the folded 
30 protein structure of CFE polypeptides in the absence of a high level of sequence similarity 
with proteins in the databases (review by Teichmann, et al., 1999 Current Opinion in 
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Structural Biology 9:390-399). The Protein Threading method predicts the compatibility of 
a query sequence (e.g., CFE polypeptide sequences) with each of the folds in a library of 
known protein structures. The library of known protein structures as developed, maintained, 
and updated throughout the search process. 

A list of potential structural folds, onto which each query was compatible, was generated for 
all CFE polypeptides (e.g, SEQ ID NOS: 114-226). The fold assignments for each query 
were used to generate pairwise sequence alignments. The pairwise sequence alignments 
were used to generate protein models of the query polypeptide (e.g., CFE polypeptides). 

The pairwise sequence alignments were also used to compare the position of critical 
residues of the structural template with the query polypeptide. The list of critical residues 
was generated by using multiple sequence alignments derived from a structural 
classification of proteins to generate a conservation profile which provided sequence- 
specific positions conserved across a homologous family of protein folds. Comparative 
modeling was used to search the model of the query polypeptide for the critical residues and 
determine whether the structural and functional motifs are conserved in the query protein. 
Conservation of structural and functional motifs permitted assignment of putative structure 
and function to a query polypeptide sequence. 

The Protein Threading method was used to search for putative folded structure and function 
for all CFE polypeptides (SEQ ID NOS:l 14-226). The CFE polypeptides having significant 
sequence identity (e.g., more than 30%) to known proteins were assigned putative functions 
with a high level of confidence, 

EXAMPLE 10 

The following provides a description of the methods used to characterize purified, CFE 
101 polypeptide. The 2CFE ; 101 polypeptide mediates the conversion of pantothenate to 
4' phosphophantothenate, and is predicted to be a pantothenate kinase. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the 
amino acid sequence of the CFE 101 polypeptide (SEQ ID NO:210) is 42% similar to the 
5 amino acid sequence of the coaA protein of E. coll Thus, CFE 101 may be a 
pantothenate kinase, which mediates the conversion of pantothenate to 4' 
phosphophantothenate (Figure 5). 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

10 

Circular dichroism and circular dichroism melt methods were used to determine the 
folded structure of the expressed and isolated 2CFE polypeptides. For example, this 
method was used to characterize the folded structure of isolated 2 CFE 101 (SEQ ED 
NO:421). 

15 

The starting concentration of the 2CFE 101 polypeptide was such that OD205 was 
approximately 1.5, and the OD280 was approximately 0.05 (e.g., 0.05 to 0.1 mg/ml). The 
starting concentration of 2CFE 101 was approximately 344 jiM in 50% glycerol, 50 mM 
Tris, 100 mM NaCl, 5 mM MgCl 2 , 0.5. mM EDTA, at pH 7.5. The polypeptide was 

20 diluted to a final concentration of 7 pM, as determined by absorbance at A 28 o> in 20 mM 
Na-phosphate, 100 mM KG, at pH 7.0. The circular dichroism analysis was performed 
using quartz cuvettes, the instrumentation was from JASCO (Model J-720), the readings 
were performed at 25 degrees C (Figure 6 A). The band width was 1 nm, the sensitivity 
was 20 mdeg, the response was 0.25 seconds, the scan speed was 50 nm/minute, and the 

25 step was 0.5. The circular dichroism thermal melt analysis was performed at a range of 
between 0 and 100 degrees C (Figure 6 B). Additionally, the circular dichroism was 
performed comparing monomer and aggregate pools of 2CFE 101. 
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Size Exclusion Analyses 

Size exclusion chromatography methods were performed using the Biosil SEC column, 
as described in Example 8 supra. The results suggest that the 2CFE 101 polypeptide 
forms monomer (40,200 Da) and oligomers (194,000 Da). The specific activity of the 
monomer and oligomeric forms of 2CFE 101 were determined, as described below. 

Biochemical Assays 

The biochemical assays of the 2CFE 101 polypeptide was based on the PK/LDH coupled 
enzyme assays described by Vallari, D. S., et al. (1987 J. Biol Chem. 262:2468-2471) 
and Song, W. -J., et al., (1994 J. Biol Chem. 269:27051-27058). 

Briefly, the assay was performed as follows. The reaction included: 885 jxl of 0.1 M 
Tris-HCl (pH 7.6), 25 \A NADH (14.1 mM), 20 ^1 ATP (10.7 mM), 50 \il phospho-enol- 
pyruvate (56 mM), 5 ^1 LDH/PK (lactose dehydrogenase/PK; Sigma, catalog # P-0294, 
60 U/ ml PK, 1050 U/ml LDH), 5 fd of the 2CFE 101 polypeptide (9 mg/ml in 50 mM 
Tris-HCl, pH 7.5, 100 mM NaCl which was diluted to 4.5 mg/ml in 50% glycerol). The 
reaction was started by adding 10 |J pantothenate (100 mM; Sigma, catalog # P2250). 
The production of ADP in the reaction was monitored by measuring the absorbance a 340 
nm. The results in Figure 8 show that the, 2CFE 101 polypeptide mediates ADP 
production in the presence of pantothenate and ATP. The K m of pantothenate (n=4) was 
144 (±16.5) |iM, the V max of the 2CFE 101 polypeptide (n=4) was 2.04 (±0.25) yM min 1 
mg" 1 . The. monomer form has a specific activity of approximately 1.7 ^iM min" 1 mg" 1 . 
The oligomeric form has a specific activity of 0.26 |iM min" 1 mg" 1 . 

Alternatively, the 2CFE 101 polypeptide can be tested in an assay that monitors the 
conversion of pantothenate to 4 5 -phosphopantothenate. The same reaction described 
above can be used, except 14 C-labeled pantothenate is used. The. reaction can be 
monitored by measuring the amount of 14 C-labeled 4'-phosphopantothanate produced. 



85 



WO 01/49721 



PCT/US00/356G4 



EXAMPLE 11 

The following provides a description of the methods used to characterize purified, CFE 
39 and CFE 21 polypeptides, carrying a C-terminal histidine 6-tag. The methods include 
helicase reactions, in which synthetic Holliday Junction templates are resolved into 
duplex structures. In one method, helicase reaction was monitored using radiolabeled 
templates. In another method, the helicase assay was adapted for use in a high 
throughput assay employing fluorescence labeled templates. 

Computer- Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
39 polypeptide (SEQ ID NO: 148) is an RuvA homologue. The comparison also 
suggests that CFE 21 (SEQ ID NO:132) is an RuvB homologue. 

Previous studies by Parsons and others have shown that RuvA and RuvB proteins, in E. 
coli t promote branch migration or movement of Holliday Junctions during genetic 
recombination and DNA repair (Parsons, C. A., et al., 1992 Proc. Natl, Acad. Scl USA 
89:5452-5456; Tsaneva, I. R., et al., 1993 Proc, Natl, Acad Scl USA 90:1315-1319; 
Muller, B., et al., 1993 J. Biol Chem. 268:17179-17184; Mitchell, A. H. and S. C. West 
1996 J. Biol- Chem. 271:19497-19502; Parsons, C. A. and S. C. West 1993 J. Molec. 
Biol 232:397-405; Tsaneva, I. R., et al., 1992 Molec. Gen. Genet. 235:1-10; Mitchell, A. 
H. and S. C. West 1994 J. Molec. Biol 1994 243:208-215). 

Size Exclusion Chromatography 

Size exclusion chromatography was performed on 2CFE 39 (SEQ ID NO:366) and 2CFE 
21 (SEQ* ID NO:350) using the Phenomenex 600 x 7.5 mm Biosep SECS 3000 column, 
as described in Example 8 supra. Protein standards (BioRad) were used to calibrate the 
column, including thyroglobulin (670,000 Da), gamma globulin (158,000 Da), ovalbumin 
(44,00 Da), myoglobin (17,00 Da), and B-12 (1350 Da). 
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The results indicate that 2CFE 39 (RuvA) forms tetramers and 2CFE 21 (RuvB) forms a 
hexameric ring structure. Selected eluted samples were electrophoresed on a 
. polyacrylamide gel (Novagen) (Figure 9). 

The Hollidav Junction Analysis Using Radiolabeled Templates 

The Holliday Junction analysis was performed using radiolabeled, synthetic, 
asymmetrical, Holliday Junction templates, as described in Hiom, K. and S. C. West 
1995 Cell 80:787-793. The Holliday Junction templates were produced by annealing 
together four separate, single-stranded, oligonucleotide strands to form four-stranded 
structures (e.g., the Holliday Junction template). The Holliday Junction templates were 
reacted with the 2CFE 39 and 2CFE 21 polypeptides, in a helicase reaction, to test then- 
ability to generate two duplex structures. 

Producing the Synthetic Hollidav Junction Templates 

The asymmetrical Holliday Junction templates were produced by annealing the following 
oligonucleotide sequences: 

Oligonucleotide strand 1 : 

5'-CCAGTGATCACATACGCTTTGCTAGGACATCTTGATATCAGCCCACGTT 
CACCCGCCTACCAGTGCCACGTTGTATGCCCACGTTGACC-3' (SEQ ID NO:438) 

Oligonucleotide strand 2: 

5 5 -GGGTCAACGTGGGCATACAACGTGGCACTGGTAGGCGGGTGAACGTGGG 
CTGATATCAAGATGTCCATCTGTCCG^TCATCTATGACGT-3 , (SEQ ID NO:439) 

Oligonucleotide strand 3: 

5 ' -AACGTCATAGATGAACGGAC AGATC ATGGTGCTTTTAAAGTCTAGAGAC 
TATCGAGCATTAGTACCAGTATCGAATCCGTCTTGTCAA-3* (SEQ ID NO:440) 
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Oligonucleotide strand 4: 

S'-TTTGACAAGACGGATTCGATACTGGTACTAATGCTCGATAGTCTCTAGAC 
TTTAAAAGCACCATGTAGCAAAGCGTATGTGATCACTG-3' (SEQ ID NO:441) 

Oligonucleotide strand 3 was labeled at the 5' end using approximately 300 ng of 
oligonucleotide strand 3, 1 [il lOx Phosphate Buffer, 5 \i\ 32 P ATP, 1 |il T4 polynuclotide 
kinase (Gibco-BRL)), in a 10 \xl volume, and the reaction was performed at 37 degrees C 
for 30 minutes. The reaction was loaded onto a G50 column to remove the 
unincorporated radiolabel. The final concentration of the radiolabeled oligonucleotide 
strand 3 was approximately 15 ng per jjJ. 

Approximately equimolar amounts of the four oligonucleotide strands were annealed 
(e.g., hybridized). The annealing reaction included: 5 jal Annealing Buffer (200 mM 
Tris-Cl pH 8.0, 100 mM MgCl 2 , 1 M NaCl, 10 mM DTT); 450 ng of radiolabeled 
oligonucleotide strand 3; and 1000 ng each of oligonucleotide strands 1, 2, and 4; in 50 jal 
total reaction volume. The control annealing reaction included: 5 ^il Annealing Buffer, 
60 ng radiolabeled oligonucleotide strand 3; 1000 ng oligonucleotide strand 4; in 50 jal 
total reaction volume. Annealing was performed at 95 degrees C for 5 minutes, 65 
degrees C for 30 minutes, 42 degrees C for 30 minutes, and room temperature (e.g., 
between about 23 to 27 degrees C) for 30 minutes to generate the synthetic Holliday 
Junction templates. The synthetic Holliday Junction templates were gel or column- 
purified to remove the duplex and non-annealed products. As a control, oligonucleotide 
strands 3 and 4 were annealed to form duplex structures. The synthetic Holliday Junction 
templates and duplex structures were stored at -20 degrees C. 

CFE 39 and CFE 21 : The Helicase Reaction Using Radiolabeled Templates 

The helicase reaction was performed to determine whether 2CFE 39 and 2CFE 21 
resolved the synthetic Holliday Junction templates into duplex structures. The helicase 
reaction was performed as follows. A 50 pi total reaction volume included: 25 nl of 2x 
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Reaction Buffer (50 mM Tris-Cl pH8.0, 30 mM MgCl 2j 2 mM ATP); I pi synthetic 
Holliday Junction template (36 ng); 2 pi 2CFE 39 (1 |iM); and 2 \xl 2CFE 21 (1 jjM). 
The reaction was incubated at 37 degrees for 30 minutes. The reaction was stopped by 
adding 5 |il Stop Buffer (100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-K, 5% SDS). The 
stopped reaction was returned to 37 degrees C for 5 minutes. The helicase reaction was 
loaded onto and run on a non-denaturing, 12% PAGE, Tris-glycine gel. 

The results shown in Figure 10, lanes 6, 7 and 8, indicate that the 2CFE 39 and 2CFE 21 
polypeptides resolved the synthetic Holliday Junction templates into duplex structures. 

CFE 39: The Helicase Reaction 

It has been previously shown that E. coli RuvA binds to Holliday Junction templates 
(Parsons, C. A., ef al., 1992 Proc. Natl, Acad. Set USA 89:5452-5456). The ability of S. 
pneumoniae CFE 39 to bind to a Holliday Junction template can be tested by employing 
the helicase assay described herein. The results of the helicase assay can be monitored by 
performing a gel shift assay and/or capillary electrophoresis. The presence of a Holliday 
Junction template bound to 2CFE 39, which migrates more slowly than the Holliday 
Junction template alone, would indicate that S. pneumoniae 2CFE 39 binds to Holliday 
Junction templates. 

CFE 39 and CFE 21: Holliday Junction Analysis Using Fluorescent-Labeled Templates 

The helicase reaction described herein was performed using Holliday Junction templates 
having one oligonucleotide strand labeled with a fluorescent agent and another strand 
labeled with a quenching agent. The 5' fluorescent end and the 3' quenching end of the 
strands that make up the Holliday Junction templates are in proximity to each other, 
resulting in a non-fluorescent template. When the Holliday Junction templates are 
resolved into duplex structures, the fluorescent and quench ends are not in proximity to 
each other, resulting in fluorescence. 
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The Holliday Junction templates used to perform this experiment comprised the 
following: the 5' end of oligonucleotide strand 1 was labeled with a fluorescein (e.g., the 
fluorescent agent), and the 3' end of oligonucleotide strand 4 was labeled with DABCYL 
(e.g., the quenching agent). The oligonucleotide strand 1 labeled with fluorescein and the 
5 oligonucleotide strand 4 labeled with DABCYL were custom synthesized (Gibco-BRL 
Life Technologies, Inc.). 

The fluorescein and DABCYL labled oligonucleotides were annealed in a reaction, as 
described above, to generate synthetic Holliday Junction templates. The helicase reaction 
10 was performed as described above. The results of the helicase reaction were monitored 
by measuring the unquenching of the Holliday Junction templates with time (Figure 1 1). 

The helicase assay using Holliday Junction templates labeled with fluorescent-quenching 
agents can be adapted for use in high throughput analyses to test 2CFE 39, 2CFE 21, and 
1 5 other polypeptides for their ability to resolve the templates into duplex structures. 

EXAMPLE 12 

The following provides a description of the methods used to characterize purified, CFE 8 
20 polypeptide, which lacks a histidine tag. The CFE 8 is a putative DNA single-stranded 
binding protein. 

Computer- Aided Comparison 

25 The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
8 polypeptide (SEQ ID NO: 121) may be a single stand binding protein homologue, such 
asSSB. 
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Size Exclusion Chromatography 

The 2CFE 8 polypeptide (SEQ ID NO:339) was characterized by size exclusion 
chromatography, using the Biosil SEC- 125 HPLC Gel Filtration column as described in 
Example 8 supra. The chromatogram showed one peak corresponding to a molecular 
weight of approximately 89 kDa. Based on the nucleotide sequence, the predicted 
molecular weight of 2CFE 8 is 17,351 Da. In non-denaturing conditions, 2CFE 8 forms a 
multimer. 

Binding Reaction 

The 2CFE 8 polypeptide was reacted with a single-stranded oligonucleotide A. Briefly, 
the binding reaction included: 50 |xM of 2CFE 8 polypeptide, 50 ^iM oligo strand A, 20 
mM Tris/20 mM KC1 pH 7.5. The binding reaction was performed at 37 degrees C, for 2 
hours. 

Oligonucleotide strand A: 

5'-TTAGGGCCCGGGCTATCTTACAATCTCGTT-3' (SEQ ID NO:442) 
Capillary Electrophoresis 

The results of the binding reaction was monitored by capillary electrophoresis, following 
the methods described in "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. 
J. Landers. 

Separation was performed using an uncoated capillary tube (360 \im o.d., 50 \xm i.d., 
with a 50 cm effective separation length; Watrex International, Inc., Pittsford, NY) and 
50 mM borate pH 9.3 as the mobile phase, at 25 kVolts, 20 minutes separation time. 

The results indicate that 2CFE 8 alone elutes as a sharp peak, indicating little adsorption 
to the uncoated capillary wall (Figure 12 A). The shape of the peak and peak retention 
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time changed with 2CFE 8 in the presence of all oligonucleotides tested (Figure 12 B). 
As a negative control, MurB polypeptide (Pucci, M. J., L. F. Discotto, and T. J. 
Dougherty 1992 "Cloning and Identification of the Escherichia coli murB DNA 
sequence, which encodes UDP-N-acetylenolpyruvoylglucosamine reductase" J, 
Bacteriol 174:1690-1693) was reacted with the same oligonucleotides. MurB reacted 
with or with out the oligonucleotides showed no change in peak shape or retention time. 

After capillary electrophoresis analyses, the 2CFE8 alone and 2CFE plus oligonucleotide 
samples were run on native polyacrylamide gels to determine whether the polypeptide 
was intact. The results indicate that in all cases, 2CFE 8 was intact and had not degraded 
with time or storage. 

Mobility Shift Assays 

The ability of 2CFE 8 polypeptide to bind oligonucleotide strand A was tested in a 
mobility shift assay. 

The results indicate that 2CFE 8 binds single stranded oligonucleotides (Figure 13 A and 
B). In Figure 13 A, the gel was stained with ethidium bromide. The unbound 
oligonucleotides appear near the bottom of the gel, while the bound oligonucleotides 
appear near the middle. The same gel was stained with Coomassie (Figure 13 B), 
revealing that 2CFE 8 polypeptide bound -to the oligonucleotide migrated further than 
unbound 2CFE 8, due to the change in charge carried by the oligonucleotide. Various 
ratios of 2CFE8:oligo were tested. The optimal binding ratio was 2: 1 . 

The Effect of MgCl 2 

The 2CFE 8 polypeptide precipitated in the presence of 5 mM MgCh. The precipitation 
was reversible by the addition of 1 fiM of the oligonucleotides tested. The observation 
indicates specific binding between 2CFE 8 polypeptide and the oligonucleotides tested. 
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Scintillation proximity, assay (SPA) methods can be used in a high throughput screening 
procedure to monitor, for example, a binding reaction. SPA utilizes beads (Amersham) 
which are coated on the surface with a particular compound or molecule. For example, 
the SPA bead may be coated with avidin to facilitate binding with any molecule having a 
biotin tag. 

The binding reaction of the 2CFE 8 polypeptide and the oligonucleotide strand A can be 
monitored using SPA beads and a scintillation counter. The beads can be coated with 
avidin, the 2CFE 8 polypeptide can be tagged with biotin, and the oligonucleotide strand 
A can be radiolabeled. 

EXAMPLE 13 

The following provides a description of the methods used to characterize purified, 2CFE 
3 (SEQ ID NO:334) and 2CFE 86 (SEQ ID NO:409) polypeptides. 

The 2CFE 3 polypeptide catalyzes the conversion of D-glucosamine-6-phosphate to D- 
glucosamine-1 -phosphate, indicating that 2CFE 3 mediates amino-sugar biosynthesis 
through the N-acetyl glucosamine pathway (Figure 14). 

The 2CFE 86 polypeptide catalyzes the conversion of D-glucosamine-1 -phosphate to N- 
acetylglucosamine-1 -phosphate, and the conversion of N-acetylglucosamine-1 -phosphate 
to UDP-N-acetylglucosamine-1 -phosphate, which indicates that 2CFE 86 also mediates 
amino-sugar biosynthesis through the N-acetyl glucosamine pathway (Figure 14). 

Computer-Aided Comparisons Of CFE 3 

The computer-aided comparison, as described in Example 9 supra, suggested that the 
CFE 3 polypeptide (SEQ ID NO:l 16) is a phosphoglucosamine mutase, such as GlmM. 
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Purification of the CFE 3 Polypeptide 

The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 polypeptide lacks a C-terminal histidine tag. 
The 2CFE 3 polypeptide was purified using a 2-column procedure. The 2CFE 3 
polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using 
a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter. Then 
the 2CFE 3 polypeptide was eluted from a hydroxyapatite Bio-gel column (Bio-Rad) 
using a 5-200 mM potassium phosphate (pH 8.0) gradient, the flow rate was 0.3 
ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 preparation was 
electrophoresed on an SDS polyacrylamide gel (Figure 4). 

Affinity Capillary Electrophoresis of CFE 3 

Affinity capillary electrophoresis methods were used to determine whether the 2CFE 3 
polypeptide binds to various glucose derivatives. Binding was performed under 
equilibrium conditions, in which the sugars were dissolved in the running buffer and 
reacts with 2CFE 3 during separation in the column. The affinity capillary 
electrophoresis method used to analyze 2CFE 3 follows the methods described in 
"Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. J. Landers. 

Briefly, 2CFE 3 polypeptide was reacted with increasing amounts of various glucose 
derivatives (e.g., substrate) at 25, 30 and 37 degrees C. .The glucose derivatives included 
UDP-glucose, glucose- 1 -phosphate, glucose-6-phosphate, glucosamine- 1 -phosphate, and 
glucosamine-6-phosphate. The reaction included: 2CFE 3 polypeptide (2.0 mg/ml), 
separation buffer (25 mM Tris; 192 mM Glycine, pH 8.0; BupH Tris-Glycine Buffer 
Packs, Pierce). Separation was performed at 25 kVolts, separation time was 15 or 20 
minutes. 

The results shown in Figure 15 A indicate that at 25 degrees C, 2CFE 3 binds to D- 
glucose-1 -phosphate in a dose-dependent manner, as the peak shape and/or the retention 
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time for 2CFE 3 changes in the presence of 100 and 500 D-glucose-1 -phosphate 
compared to unreacted 2CFE 3. 



The results shown in Figure 15 B indicate that at 25 degrees C, 2CFE 3 binds to D- 
5 glucosamine-6-phosphate in a dose-dependent manner, as the peak shape and/or the 
retention time for 2CFE 3 changes in the presence of 100 and 500 jiM D-glucosamine-6- 
phosphate compared to unreacted 2CFE 3. . 

The results shown in Figure 15 C indicate that at 25 degrees C, the 2CFE 3 polypeptide 
1 0 also binds to glucose-6-phosphate. 

A comparison of 2CFE 3 reacted with various glucose derivatives, at 30 degrees C, is 
shown in Figure 1 5 D. The results indicate that D-glucosamine-6-phosphate is a putative 
substrate for 2CFE 3, as this reaction exhibits the greatest change in peak shape and/or 
15 retention time. 

CFE 3: Capillary Electrophoresis and Laser-Induced Fluorescence 

In a further analysis of 2CFE 3 polypeptide, capillary electrophoresis was performed with 
20 laser-induced fluorescence in order to separate and detect interaction between the 
substrate (e.g., D-glucosamine-6-phosphate) and the product (e.g., D-glucosamine-1- 
phosphate) in a one dose, one time-point procedure. 

The 2CFE 3 polypeptide was derivitized by reacting 10 mM FITC (fluorescein 
25 isothiocyanate dissolved in methanol; Calbiochem, San Diego, CA) with D-glucosamine- 
6-phosphate, at ambient temperature, in the dark, overnight. The FITC-derivatized 2CFE 
3 polypeptide (2.0 mg/ml) was reacted with the substrate (D-glucosamine-6-phosphate 
and D-glucosamine- 1 -phosphate) for one hour. 

30 Separation was performed using an uncoated capillary (360 urn o.d., 50 \xm i.d., with a 
50 cm effective separation length) and 50 mM borate (pH 9.3) as the mobile phase. The 
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argon-ion laser had an excitation wavelength of 488 run and an emission filter of 520 nm 
(Beckman, Fullerton, CA). The results shown in Figure 16 indicate that 2CFE 3 binds 
and catalyzes the conversion of D-glucosamine-6-phosphate to D-glucosamine-l- 
phosphate. 

Computer- Aided Comparison Of CFE 86 

The comparison results, as described in Example 9 supra, suggested that the CFE 86 
polypeptide (SEQ ID NO: 195) is an acetyltransferase, such as GlmU which is a 
Afunctional enzyme in E. coli. It has been previously shown that, in E coli, GlmU is a 
Afunctional protein having both the acetyltransferase and uridylyltransferase active sites 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 J. Bacteriol 176:5788-5795; Gehring, 
Al., et al., 1996 Biochemistry 35:579-585). The bifiinctional enzyme catalyzes the 
conversion of D-glucosamine-1 -phosphate to N-acetylglucosamine-1 -phosphate 
(acetyltransferase), and catalyzes the conversion of N-acetylglucosamine-1 -phosphate to 
UDP-N-acetylglucosmine- 1 -phosphate (uridylyltransferase). The Km of the 
acetyltransferase and uridylyltransferase reactions has been previously calculated 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 supra ). Additionally, the crystal 
structure of GlmU from E. coli is known (Brown, K., et al., 1999 EMBO J. 18:4096- 
4107). 

Purification of the CFE 86 Polypeptide 

The 2CFE 86 polypeptide (SEQ ID NO:409) has a C-terminal histidine tag. The 2CFE 
86 polypeptide was produced using the large scale IPTG-induced method described in 
Example 5, supra. The 2CFE 86 polypeptide was purified using the Ni-NTA affinity 
column method described in Example 6, supra. The eluted 2CFE 86 polypeptide was 
dialyzed against 50 mM Tris-Cl, 100 mM NaCl, 25% glycerol, pH 8.0. Samples of the 
purified 2CFE 86 polypeptide were electrophoresed on a polyacrylamide gel (Figure 17). 
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Coupling CFE 3 and CFE 86 to Produce UDPAG 

A biochemical assay was performed, to determine whether 2CFE 3 and 2CFE 86 convert 
D-glucosamine-6-phosphate to UDP-N-acetylglucosamine-1 -phosphate (e.g., UDPAG). 
The 2CFE 3 and 2CFE 86 polypeptides were used in a coupled reaction based on the 
assays described in Jolly, L. P., et al., 1999 Eur. J. Biochem. 262:202-210. 

A time-dependent and dose-dependent assay were performed. Briefly, the assay was 
performed in 96- well plates, each well including 100 (il volume. The assay included: 1 
raM D-glucosamine-6-phosphate (Sigma); 0.7 mM D-glucosamine-l,6-diphosphate 
(Sigma); 1.2 mM acetyl-Coenzyme A (Sigma); and 5 mM uridine-5 '-phosphate (Sigma); 
3 mM MgCl 2 (Sigma); 50 mM Tris-Cl, pH 8.0 (Life Technologies). The reaction was 
started by adding 1 jig of 2CFE 3; and 10 |ag of 2CFE 86. The reaction was performed at 
room temperature. The reaction was stopped at 0, 15, 30, and 65 minutes, by filtering out 
the 2CFE polypeptides. 

The results of the assay was monitored by HPLC (high pressure liquid chromatography) 
using an Optisil IOji SAX column (250 x 4.6 mm), measuring at 262 nm, the mobile 
phase was 150 mM KH2PO4 (pH 3.5), and 1.5 ml/minute flow rate. The results shown in 
Figure 18 show the time-dependent assay and indicate that HPLC detected the presence 
of UDPAG. 

CFE 86: The Uridvlvltransferase Reaction 

The 2CFE 86 polypeptide was tested in a uridylyltransferase reaction, in which N-acetyl- 
D-glucosamine-1 -phosphate and UTP produce UDP-N-acetylglucosamine. The 
uridylyltransferase reaction was monitored using a malachite green/inorganic 
pyrophosphatase assay (e.g., malachite green-IPPAse assay) and/or monitored using 
HPLC. The malachite green-IPPAse assay was used to measure orthophosphate 
production from digestion of the pyrophosphate liberated in the uridylyltransferase 
reaction. 
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The malachite green reagent was prepared as follows. A 0.045 % solution of malachite 
green (Sigma; M9636) was prepared in water. A 4.2 % solution of ammonium 
molybdate (Mallinckrodt) was prepared in 4N HC1. The malachite green and ammonium 
5 molybdate were mixed in a 3:1 ratio, and stirred for about 20 minutes. The mixture was 
filtered, and stored at 4 degrees C. The inorganic pyrophosphatase (Sigma; 1-2267) was 
diluted to 0.1 U/|il in 50 mM Tris/3mM MgCl 2 ph 8.0, and stored at 4 degrees C. 

The uridylyltransferase reaction was performed in 96-well plates. The coupled reaction 
10 described herein was performed, in the presence of 2CFE 3 alone or 2CFE 3 and 2CFE 
86, and included the addition of 0.5 U/well of the diluted inorganic pyrophosphate. The 
reaction was mixed for 5 minutes at room temperature. The reaction was stopped by the 
addition of 240 |al/well of the malachite green reagent and 30 (j.l/well of 34% sodium 
citrate, and the reaction was mixed. The results of the uridylyltransferase reaction was 
15 monitored by spectrophotometry at 660 nm. 

The results of separate uridylyltransferase reactions were monitored by HPLC, using a 
Phenosphere-NEXT CI 8 column (250 x 4.6 mm). The mobile phases included A and B 
as follows: A) methanol/10 mM potassium phosphate pH 6.5 (0:100); and B) 
20 methanol/10 mM potassium phosphate pH 6.5 (40:60). The mobile phases were .run 
under the following conditions: 100% mobile phase A for 5 minutes, to 100% mobile 
phase B in 3 minutes; and hold 100% mobile phase B for 9 minutes. The retention time 
for the UDPAG product is approximately 5.75 to 6.0 minutes. 

25 The results three uridylyltransferase reactions, monitored by HPLC are summarized in 
Table IH below. 
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TABLE HI 



Purified CFE 86: 

2CFE 86-1 
2CFE 86-2 
2CFE 86-3 



Specific Activity 
(nmol/min/ug): 

3.1 
3.4 
3.1 



The results of the uridylyltransferase reactions, monitored by HPLC or HPLC and 
Malachite Green IPPAse assays are summarized in Table IV below. 

TABLE IV 



Reaction: 

Acetvltransferase reaction: 

Glucosamine- 1-P 
Acetyl-coA 

Uridvlvtransferase reaction: 

N-acetylglucosamine- 1 -P 
UTP 



Km (uMY. 



94 
150 



48 
79 



Method: 



HPLC 
HPLC 



HPLC and MG/IPPAse 
HPLC 



EXAMPLE 14 



The following provides a description of the methods used to characterize various 2CFE 
polypeptides, including CFE 21, 34, 35, 39, and 90. The molecular weight of these 2CFE 
polypeptides were analyzed by size exclusion chromatography and gel electrophoresis. 
The 2CFE 34, 35, and 90 polypeptides putatively mediate fatty acid biosynthesis. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that CFE 34 
(SEQ ID NO:143), CFE 35 (SEQ ID NO: 144), and 90 (SEQ ID NO:199) are 
5 polypeptides which mediate a fatty acid biosynthesis pathway (Figure 19) 

The comparison suggests that CFE 34 is a malonyl CoA:ACP transcylase, which 
catalyzes the reaction in which malonyl CoA and acyl carrier protein (ACP) are 
converted to malonyl-ACP and CoA. Thus, the CFE 34 polypeptide may be a homologue 
10 ofK coliFsbD. 

The comparison suggests that CFE 90 is a 3-oxoacyI-ACP synthase II (beta ketoacyl- 
ACP synthase II) which catalyzes the reaction in which malonyl-ACP is converted to 
beta aceto acetyl- ACP. Thus, the CFE 90 polypeptide may be a homologue of E. coli 
15 FabF. 

The comparison suggests that CFE 35 is a 3-oxoacyl-ACP reductase (beta aceto acetyl 
ACP reductase) which catalyzes the reaction in which beta-keto-acetyl-ACP is converted 
to beta-hydroxy-acetyl-ACP. Thus, the CFE 35 polypeptide may be a homologue of E. 
20 co/zFabG. 

Size Exclusion Chromatography 

The estimated molecular' weights of 2CFE 34 (SEQ ED NO:361), 2CFE 35 (SEQ ID 
25 NO:362), and 2CFE 90 (SEQ ID NO:413) were determined using the Biosil SEC-125 
HPLC Gel Filtration column as described in Example 8, supra 

The results suggest that 2CFE 34 polypeptide is a monomelic protein (33,093 Da), 2CFE 
35 is a trimeric protein (25,758 Da; approximately 85%), and 2CFE 90 is a dimeric 
30 protein (43,930 Da). Selected eluted samples of 2CFE 34 were electrophoresed on a 
polyacrylamide gel (Figure 20). 
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Biochemical Assay: CFE 34 

The function of 2CFE 34 was determined by performing various biochemical reactions. 
5 To determine whether 2CFE 34 catalyzes the convertion of malonyl-CoA to malonyl and 
CoA, the following reaction was performed. 

The biochemical reaction was performed in the presence of acyl carrier protein. The 
reaction included the following: 10 |xM 14 C labeled malonyl-CoA, 20 ACP, 30 |iM 

10 2CFE 34 (e.g., FabD) in 20 mM Tris-Cl, pH 8.0 and 5 mM DTT in 300 jil volume. The 
reaction was performed at room temperature (e.g., approximately 24 degrees C) for 30 
minutes. The reaction was terminated with the addition of 45pl of 0.5% TFA. The 
labeled reaction was injected onto a MonoQ 5/5 column on a Gilson HPLC. Detection 
was performed by monitoring the radioactivity of the continuous flow-through of the 

15 HPLC effluent. Chromatography was performed using a buffer gradient for column 
elution. Buffer A included 20 mM Tris-Cl, pH 8.3. Buffer B was the same as Buffer A 
and included 1 M NaCl. The program was held at 90% A, 10% B for 10 minutes 
followed by a linear ramp to a final mix of 50% of each Buffer A and B over 10 minutes. 

20 The substrate (e.g., I4 C malonyl-CoA) eluted at 9.9 minutes, the product (e.g., I4 C 
malonyl-ACP) eluted at 14.3 minutes. The results indicate that CFE 34 catalyzes the 
conversion of malonyl-CoA and acyl carrier protein (ACP) to malonyl-ACP and CoA. 

EXAMPLE 15 

25 

The following provides a description of the methods used to characterize CFE 
polypeptides 40, 41, and 46. 
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Computer-Aided Comparison 

The computer-aided comparison, as described in Example 9 supra, suggests that the CFE 
40 polypeptide (SEQ ID NO: 149) is a phosphomethylpyrimidine (HMP-P) kinase 
5 involved in thiamine biosynthesis. 

* The comparison, as described in Example 9 supra, suggests that the CFE 41 polypeptide 
(SEQ ID NO: 1 50) has a GTP-binding motif and may be a protease. 

10 The comparison, as described in Example 9 supra, suggests that the CFE 46 polypeptide 
(SEQ ID NO: 1 55) has an ATP-binding motif. 

Affinity Purification of CFE 41 

15 The large-scale method described in Example 5 supra (e.g., IPTG-induced protein 
production) was used to prepare a sample of 2CFE 41 polypeptide (SEQ ID NO:368). 
The sample was affinity purified using the Ni-NTA method described in Example 6, 
supra. The eluted fractions were loaded onto and run on a 12% SDS-PAGE gel (No vex) 
(Figure 21). 

20 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

Circular dichroism and circular dichroism thermal melt methods were performed using 
JASCO instrumentation. The concentration of the isolated 2CFE 40 (SEQ ID NO:367) 
25 was approximately 21 jiM, in a 0.1 cm pathlength cell at 210 nm. The circular 
dichroism spectrum suggests that this preparation of 2CFE 40 had mixed alpha and beta 
secondary structure. The circular dichroism thermal melt spectrum suggests that 2CFE 
40 has a T m of approximately 67 degrees C. The 2CFE 40 polypeptide precipitates at 
approximately the T m . 
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The concentration of the isolated 2CFE 41 (SEQ ID NO:368) was approximately 70 |iM, 
in a 0.02 cm pathlength cell. The circular dichroism spectrum suggests that this 
preparation of 2CFE 41 had mixed alpha and beta secondary structure, with a greater 
percentage of alpha structures. The circular dichroism thermal melt spectrum suggests 
that 2CFE 41 has a T m of approximately 38 degrees C. The 2CFE 41 polypeptide 
precipitates at approximately the T m . 

The concentration of the isolated 2CFE 46 (SEQ ID NO:373) was approximately 23 jaM, 
in a 0. 1 cm pathlength cell at 280 nm. The circular dichroism spectrum suggests that this 
preparation of 2CFE 46 had mixed alpha and beta secondary structure. The circular 
dichroism thermal melt spectrum suggests that 2CFE 46 is highly stable at elevated 
temperatures. At 90 degrees C, the 2CFE 46 polypeptide exhibited only a 27% loss in 
signal and the polypeptide remained soluble. 

Capillary Electrophoresis 

Capillary electrophoresis was performed on samples of purified 2CFE 40, 41 and 46. 
The electropherograms of 2CFE 40, 41, and 46 are shown in Figure 22. 

EXAMPLE 16 

The following provides a description of methods that can be used to characterize CEG 
polypeptides (e.g., GFE polypeptides). 

Computer- Aided Compilation 

Computer-aided compilation of bacterial metabolic pathways may be analyzed using 
Pathway Tools from Doubletwist, based on the EcoCyc system (Karp P.D., et al., 1999 
Nucleic Acids Res. 1999 27(l):55-58). This analysis may be used to predict which CFEs 
mediate various steps of the pathways. This information may be used in combination 
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with the results of a binding reaction which identifies a ligand or substrate that binds with 
a CFE polypeptide. 

Identifying the Function of a CFE Polypeptide 

The function of a CFE polypeptide may be identified by identifying a ligand or substrate 
which binds with the CFE polypeptide. The ligand or substrate may be identified using 
fractionation and affinity capillary electrophoresis methods. The following method is 
based upon the assumption that the bacterial cell lysate includes the ligand or substrate. 

A bacterial host cells carrying an endogenous (e.g. native) CFE gene or carrying a 
recombinant vector which includes a CFE gene may be cultured so that the CFE 
polypeptide is produced by the cell. The cells may be ruptured in order to obtain the cell 
lysate. The cell lysate may be fractionated using HPLC technology. The HPLC fractions 
may be reacted with a CFE polypeptide in a binding reaction, and the binding reaction 
may be analyzed by affinity capillary electrophoresis methods. The ligand or substrate 
which reacts with the CFE polypeptide may be identified using mass spectrophotometry 
methods (in "Mass Spectrometry" 1990 eds. McCloskey, J. A., in Methods in 
Enzymology volume 193; Henion, J., et al., 1993 "Mass Spectrometric Investigations of 
Drug-Receptor Interactions" Then Drug Monit 15:563-569; Loo, J. A., et al., 1999 
"Application of Mass Spectrometry for Target Identification and Characterization" Med. 
Res : Rev. 19:307-319; Nguyen, D. N., et al., 1995 "Protein Mass Spectometry: 
Applications to Analytical Biotechnology J. Chromatogr.7Q5:2l-45). 

EXAMPLE 17 

The following provides a description of nuclear magnetic resonance (NMR) spectroscopy 
methods that were used to characterize CFE polypeptides. 

High resolution NMR spectroscopy was applied to ,5 N-labled, l3 C/ 15 N-labeled, 
2 H/ I3 C/ I5 N-labeled, and type-specifically isotopically labeled CFE polypeptide samples 
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in the solution state for the following purposes: to assess various aspects of the structural 
state, e.g., foldedness, structural integrity; to refine a previously determined experimental 
structure of a close sequence homologue; to refine a homology-modeled structure; to 
assess the potential for a CFE polypeptide to bind small molecules; and to identify small- 
molecule pharmacophoric fragments that bind specifically to the CFE polypeptide 
(VNuclear Magnetic Resonance" 1994 eds. James, T. L. in Methods in Enzymology 
volume 239). 

The NMR analysis includes screening both a compound deck of approximately 4,500 
commercially available, structurally and chemically diverse compounds (the small- 
molecule pharmacophore deck) and a compound deck of proprietary, known, anti- 
microbial compounds (anti-microbial deck) against the CFE polypeptides (i.e., target 
polypeptides) to determine, either based upon perturbations to the chemical shifts of the 
amide proton and/or nitrogen resonances, as measured from a two-dimensional proton- 
nitrogen heteronuclear single-quantum correlation spectrum (2D screening method), or 
based upon increases in the linewidth of the compound's proton resonance(s), as 
measured by a one-dimensional Ti p spin-lock difference spectrum (ID screening 
method), both whether a compound binds to a CFE polypeptide and, in the case of the 2D 
screening method, where the compound binds on the CFE polypeptide. 

Isotopic Labeling of CFE Polypeptides 

BL21-DE3 E. coli bacteria are transformed with the CFE expression vectors. Expression 
takes place between 20°C and 37°C in minimal media containing [ 15 N] -ammonium 
sulfate as the sole nitrogen source and either glucose, [ 2 H]i3-glucose, or [ l3 C]6-glucose as 
the sole carbon source. Glucose is used for.preparing uniformly 15 N-labeled and 2 H/ I5 N- 
Iabeled CFE polypeptides. [ 2 H]i3-glucose is used for preparing type-specifically ^Re- 
labeled, uniformly 15 N-labeled CFE polypeptides. [ I3 C]6-glucose is used for preparing 
I3 C/ J5 N-labeIed CFE polypeptides. The minimal media is prepared in 100% H 2 0 for 
expressing both uniformly 1 ^-labeled and uniformly I3 C/ ! ^-labeled CFE polypeptides; 
the minimal media is prepared in 95% D2O (deuterium oxide) and 5% H2O for expressing 
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both type-specifically ^^C-labeled, uniformly 1 ^-labeled and just uniformly 2 H/ 15 N- 
labeled CFE polypeptides. In the case of type-specifically W^-labeled, uniformly 15 N- 
labeled CFE polypeptides, 40 mg/L of protonated and uniformly l3 C? } ^-labeled 
isoleucine, valine and leucine amino acids are added to the minimal media. 

5 

NMR Screening 

Compounds in the anti-microbial deck are pre-dissolved to a target concentration of 16 
rnM in deuterated DMSO (dimethylsulfoxide) with each deck well containing only one 
10 compound. Compounds in the small-molecule, pharmacophore deck are pre-dissolved in 
deuterated dmso to a target concentration of 50 mM in groups of 8, i.e., each deck well 
contains 8 unique compounds with each compound at a target concentration of 50 mM. 

3.5 ill of compound is placed at the bottom of a well in a 96-well, screening plate. This 
15 well will be referred to as the compound screening well. Each compound screening well 
contains solution from only one deck well. 166.5 \il of buffer is added to each compound 
screening well. 170 ^1 of a CFE polypeptide solution, initially at a concentration ranging 
from 200-300 jjM, is added to each compound screening well; the contents of that well 
are then thoroughly mixed. The control screening well contains only 3.5 ^1 of deuterated 
20 dmso. The screening plate is then centrifuged in a bucket rotor for 15 minutes at 3,500 
rpm to insure that all particulate matter is at the bottom of the well. 

The 2D screening method requires a single control screening well in which the compound 
solution consists only of deuterated DMSO. The ID screening method requires a control 
25 screening well for each compound screening well. In the case of the ID screening 
method, the control screening well is prepared identically to the compound screening 
well except that the 1 70 jal of a CFE polypeptide solution is replaced by 1 70 |al of buffer. 

The screening plate is covered with aluminum foil and placed onto a rack of a Gilson 
30 liquid handler. The Gilson liquid handler, under computer control by the NMR host/data- 
acquisition software, is responsible for removing each sample from the screening plate, 
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injecting the sample into a high-resolution, *H/ I5 N double-resonance NMR flow-probe, 
removing the sample from the flow-probe, and dispensing it back into the screening plate 
well from which the sample was originally removed. NMR data are collected on the 
sample while the sample resides in the NMR flow-probe. The type of NMR data 
5 collected depends upon whether the 2D or ID screening method is being used. 

Determining Structural Characteristics of a CFE Polypeptide 

In assessing various aspects of the structural state of a CFE polypeptide, NMR was used 
10 to provide the following information. The proton ID spectra and proton-nitrogen 2D 
correlation NMR spectra were used to assess the overall foldedness of a CFE polypeptide 
without actually describing in detail that folded state. Unfolded and substantially 
misfolded proteins produced distinct signatures in these two types of NMR spectra. 

15 The chemical shift of most protein nuclei in either the set {H N , H a , Hp, C\ Qx, Cp, N} or 
the set {H N , C\ C a , Cp, N} for perdeuterated (e.g., 2 H-labeled) proteins were determined 
by procedures well known in the art that involve collecting up to 10 triple-resonance 
NMR data sets. The protein secondary structure was delineated as either helical, turn or 
extended (e.g., p-sheet) by measuring A(5ca - Sep), A8C\ and A8h<x where 8 refers to the 

20 chemical-shift value and A refers to the difference between chemical-shift values 
measured in this protein and those measured for the same residue type in a random-coil 
(unstructured), tetrameric peptide. 

This secondary-structure profile was generated in approximately 2-3 weeks per protein. 
25 The secondary-structure profile was used to confirm the functional identity of a protein. 
It was also used to refine the list of possible functional identities of folds, predicted by 
various computational techniques including fold recognition which is associated with a 
protein or polypeptide. 



107 



WO 01/49721 PCT/USOO/35604 

NMR was used to generate folds of proteins or polypeptides for which both no structure 
was known of a sequence homologue and no structural homologue was discernible in the 
PDB by fold recognition techniques. 

5 Refining a Structural Model 

Nuclear Overhauser (NOE) data were used to refine both homology-modeled structured 
and previously determined experimental structures of close sequence homologues. This 
process took approximately 2-3 weeks per structure. 

The CFE 88 polypeptide was characterized by NMR analysis to establish its secondary 
structure. The NMR data was used to filter the computer-aided threading analysis. The 
NMR-determined secondary structure for CFE 88 suggested that CFE 88 is structurally 
similar to 4-aminoimidazole carboxylase. 

15 

The characteristics of other CFE polypeptides were analyzed by NMR methods. A 
computer-aided threading analysis revealed that the N-terminal domain of the protein 
EGA, which both binds and hydrolyzes GTP, was both structurally similar and 
sufficiently similar in sequence to CFE 52 to suggest that CFE 52 had a similar function. 

20 

The NMR data of CFE 103 suggests that this polypeptide is unfolded. Circular dichroism 
spectra, as a function of temperature, also indicated that CFE 103 was unfolded. 

The CFEs 2, 42, 43, 68 and 88 polypeptides were tested for their ability to bind potential 
25 inhibitor molecules by screening both the anti-microbial deck and the small-molecule, 
pharmacophore deck. CFE 34 was tested for its ability to bind potential inhibitor 
molecules by screening the anti-microbial deck. 
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NMR-based screening was used to measure binding against both the small-molecule, 
pharmacophore deck and the anti-microbial deck. Binding data from these screens 
5 allowed assessment of the propensity of a protein to bind small molecules. The binding 
data was also used to identify sites on the protein which are capable of binding small 
molecules. The binding data was also used to identify common pharmacophores among 
the compounds which bind. 

10 Reverse screening refers to a process whereby known anti-microbial compounds, the 
microbial target of which is unknown, are screened by a general method, e.g., binding as 
assessed by NMR, to find a physical interaction with polypeptide targets previously 
determined to be essential to the bacteria (i.e., the CFEs). The reverse screening method 
was used to determine which CFE polypeptides bind to which compounds in the anti- 

15 microbial deck. The reverse screening method included the following. The compounds 
in a proprietary compound deck were screened for Minimal Inhibitory Concentration 
(e.g., MIC). The compounds exhibiting antimicrobial activity were designated active 
compounds. The CFE polypeptides were screened to determine which polypeptide bind 
to which active compounds. The CFE polypeptides which bound to the active 

20 compound(s) were confirmed, where possible, i.e., in cases where an in-vitro assay was 
possible to construct, as being inhibited in their function as a polypeptide by the active 
compound(s) by examination of the inhibition profile of the compound(s) against the 
CFE polypeptides. For additional confirmation, the effect of the compound on the 
microorganism harboring the CFE polypeptide was monitored (e.g., whole cell assays). 

25 The structure of the active compound was used as a basis to generate chemically-related 
compounds by iterative synthesis. The chemically-related compounds were tested in a 
screening assay for binding with CFE polypeptides. The active compounds and the 
chemically-related compounds of interest were the compounds which exhibited an 
increase in binding affinity for a CFE polypeptide and/or exhibited drug-like properties. 

30 
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The results of the reverse screening are as follows. 127 compounds from the proprietary 
compound deck exhibited anti-microbial activity. 94 of these active compounds were 
selected based upon both lack of cytotoxicity and lack of excessive hydrophobicity. 
These 94 compounds were soluble to 16 mM in deuterated DMSO; these compounds 
5 were also deemed to be sufficiently soluble in aqueous buffer for both the 2D and ID 
NMR screening methods. 

This subset of 94 compounds was used in an NMR-based screen to determine which 
compound binds to which CFE polypeptide. The CFE 42 polypeptide bound two 
10 different compounds with K d 's in the range of 0.2 to 1 mM; the CFE 43 polypeptide 
bound one compound with Kd ~ 30-50 |oM; the CFE 34 polypeptide bound 13 
compounds, one of which inhibited the polypeptide function with IC50 < 10 jiM. 

The enzyme assay used to confirm the NMR results which suggested CFE 34 interaction 
15 with the compounds included the following: 10 jiM I4 C-labeled malonyl CoA; 20 \M 
ACP, 30 pM CFE 34; 20 mM Tris-Cl, pH 8.0; 5 mM DTT; in the presence of absence of 
50 \M of a compound solubilized at 40 mM in 100% DMSO and dilute 100-fold into 
10% DMSO and further diluted 8-fold for a final concentration of 50 ^M in 1.25% 
DMSO. The reaction was performed at room temperature, the reaction was stopped with 
20 the addition of TFA. Two hundred \il of the reaction was injected onto a Mono Q 5/5 
column. The chromatography conditions included: A) 20 mM Tris-Cl, pH 8.3; B) 20 
mM Tris-Cl, pH 8.3, 1 M NaCl. Hold 10% B for 5 minutes, linear gradient from 10% B 
to 50%B in 10 minutes, back to 10% B in 1 minute, hold for 14 minutes to re-equilibrate. 
The reaction substrate ( I4 C- malonyl CoA) eluted at 9.9 minutes, the reaction product 
25 ( 14 C-malonyl ACP) eluted at 14.3 minutes. 
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What is claimed is: 

1. An isolated nucleic acid molecule encoding a polypeptide which is (1) essential 
for the viability of a bacterial cell and (2) has at least any one of the functions of a 
pantothenate kinase, a Holliday Junction branch migration protein, a single 
stranded DNA binding protein, a phosphoglucosamine mutase, an 
acetyltransferase, an uridylyltransferase, a malonyl CoenzymeArACP transcylase, 
a 3-oxoacyl-ACP synthase II, a " 3-oxoacyl-ACP reductase, a 
phosphomethyipyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
binding protein, or a 4-aminoimidazole carboxylase. 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO: 97 or Figure 115 and wherein the polypeptide is a 
pantothenate kinase. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:35, Figure 60, SEQ ED NO: 19, or Figure 44,and wherein 
the polypeptide is a Holliday Junction branch migration protein. 

4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO: 8 or Figure 33 and wherein the polypeptide is a single 
stranded DNA binding protein. 

5. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:3 or Figure 28 and wherein the polypeptide is a 
phosphoglucosamine mutase. 

6. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
acetyltransferase. 
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7. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:82 or Figure 103 and wherein the polypeptide is a 
uridylyltransferase. 

8. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:30 or Figure 55 and wherein the polypeptide is a 
malonyl CoenzymeA:ACP transcylase. 

9. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID N0.86 or Figure 107 and wherein the polypeptide is a 3- 
oxoacyl-ACP synthase II. 

10. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:31 or Figure 56 and wherein the polypeptide is a 3- 
oxoacyl-ACP reductase. 

11. The isolated nucleic acid molepule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:36 or Figure 61 and wherein the polypeptide is a' 
phosphomethylpyrimidine (HMP-P) kinase. 

12. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:37, Figure 62, SEQ ID NO:48, or Figure 73, and 
wherein the polypeptide is a GTP binding protein. 

13. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:42 or Figure 67 and wherein the polypeptide is a ATP 
binding protein. 
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14. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:84 or Figure 105 and wherein the polypeptide is a 4- 
aminoimidazole carboxylase. 

15. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:48 or Figure 73 and wherein the polypeptide is a GTP 
binding protein. 

16. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of SEQ ID NOS:l-l 13. 

17. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, the nucleic acid molecule comprising a sequence 
shown in any one of Figures 26-130. 

18. An isolated nucleic acid molecule encoding any one of a polypeptide designated 
CFE 1-117 having the amino acid sequence shown in SEQ ID NO: 1 14-226. 

19. An isolated nucleic acid molecule comprising a nucleotide sequence which is 
complementary to the nucleotide sequence of claim 1, 16, 17 or 18. 

20. The isolated nucleic acid molecule of claim 1, 16, 17 or 18 which is DNA or 
RNA. 

21. The isolated nucleic acid molecule of claim 20, which is labeled with a detectable 
marker. 

22. The isolated nucleic acid molecule of claim 21, wherein the detectable marker is 
selected from the group consisting of a radioisotope, a fluorescent compound, a 
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bioluminescent compound, a chemiluminescent compound, a metal chelator and 
an enzyme. 



23. A vector comprising the nucleotide sequence of claim 1, 16, 17, or 18. 

24. A host- vector system comprising the vector of claim 23, in a suitable host cell. 

25. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of a yeast cell, a plant cell, and an animal cell. 

26. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of an Escherichia cell a Bacillus cell, a Pseudomonas cell, a 
Streptococcus cell, and a Streptomyces cell. 

27. An isolated polypeptide which is essential for the viability of a bacterial cell 
comprising the amino acid sequence as shown in any one of SEQ. ID NOS: 1 14- 
226. 

28. An isolated polypeptide which is essential for the viability of a bacterial cell 
encoded by the isolated nucleic acid molecule of claim 1, 16, 17, or 18. 

29. The isolated polypeptide of claim 27 or 28 which is a fusion polypeptide. 

30. A method for producing a polypeptide having the.amino acid sequence of any one 
of SEQ ED NOS: 114-226 or a polypeptide encoded by the polynucleotide 
sequence as shown in any one of Figures 26-130, comprising: 

a) culturing the host- vector system of claim 24 under suitable conditions so as to 
produce the polypeptide; and 

b) recovering the polypeptide so produced. 
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3 1 . A polypeptide produced by the method of claim 30. 

32. A ligand which binds the polypeptide of claim 27 or 28. 

33. The ligand of claim 32 which is an antibody or an immunologically active 
fragment thereof. 

34. The ligand of claim 33, wherein the antibody is a monoclonal antibody. 

35. The ligand of claim 32 which is a diazalactone. 

36. The ligand of claim 35, wherein the diazalactone comprises the structure: 




37. The ligand of claim 32 which is a A^-protected amino acid. 

38. The ligand of claim 37, wherein the iV-protected amino acid . comprises the 
structure: 




O- 



39. The ligand of claim 32 which is an azabicyclodiene. 
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40. The ligand of claim 39, wherein the azabicyclodiene comprises the structure: 




.5 41. The ligand of claim 32 which is an alkaloid. 

42. The ligand of claim 41, wherein the alkaloid comprises the structure: 




10 

43. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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44. The ligand of claim 41, wherein the alkaloid comprises the structure: 




45. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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46. The ligand of claim 41, wherein the alkaloid comprises the structure: 



Ci 




47. A method for detecting the presence of the polypeptide of claim 27 or 28 in a 
sample, comprising contacting the sample with a ligand which binds the 
polypeptide and detecting the binding of the polypeptide with the ligand in the 
sample. 

48. The method of claim 47, wherein the detecting comprises: 

a) contacting the sample with the ligand; and 

b) determining whether a polypeptide-ligand complex is so formed. 

49. The method of claim 47, wherein the sample is a cell, a tissue, or a biological 
fluid. 

50. The method of claim 47, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat. 

51 . The method of claim 47, wherein the ligand is a diazalactone. 



118 



WO 01/49721 PCT/US00/35604 



52. The method of claim 51, wherein the diazalactone comprises the structure: 




NO r 



53. The method of claim 47, wherein the ligand is a JV-protected amino acid. 

5 

54. The method of claim 53, wherein the //-protected amino acid comprises the 
structure: 




O- 



55. The method of claim 47, wherein the ligand is an azabicyclodiene. 

10 

56. The method of claim 55, wherein the azabicyclodiene comprises the structure: 




57. The ligand of claim 47 which is an alkaloid. 

15 
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58. The ligand of claim 57, wherein the alkaloid comprises the structure: 




5 

59. The ligand of claim 57, wherein the alkaloid comprises the structure: 




60. The ligand of claim 57, wherein the alkaloid comprises the structure: 

10 
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61. The ligand of claim 57, wherein the alkaloid comprises the structure: 




62. The ligand of claim 57, wherein the alkaloid comprises the structure: 




63. A method for detecting the presence of a target nucleic acid molecule as shown in 
any one of SEQ ID NOS:l-113 in a sample, comprising contacting the sample 
with the complementary nucleic acid molecule of claim 19 and detecting the 
binding of the target nucleic acid molecule with the complementary nucleic acid 
molecule in the sample. 
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64. The method of claim 63, wherein the detecting comprises: 

a) contacting the sample with the complementary nucleic acid molecule; and 

b) determining whether a complex comprising the target nucleic acid molecule 
and the complementary nucleic acid molecule is so formed. 

65. The method of claim 63, wherein the sample is a cell, a tissue, or a biological 
fluid. 

66. The method of claim 63, wherein the sample is blood, serum, a swab from nose, a 
swab from ear, or a swab from throat. 

67. A pharmaceutical composition comprising the nucleic acid molecule of claim 1, 
16, 17, or 18. 

68. A pharmaceutical composition comprising the polypeptide of claim 27 or 28. 

69. A pharmaceutical composition comprising the ligand^of claim 32. 

70. A method for determining whether a genomic nucleotide sequence of interest is 
. essential for viability of a bacterial cell, comprising 

a. integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest, wherein the exogenous nucleotide sequence 
comprises a portion of an open reading frame of the genomic nucleotide 
sequence of interest, and 

b. deteniiining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable. 

71 . The method of claim 70, wherein the portion of the open reading frame comprises 
about 200 to 500 base pairs in length. 
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72. The method of claim 70, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

5 ' 

73. The method of claim 70, wherein determining comprises selecting the cell having 
the genome so integrated in the presence of a selection agent. 

74. The method of claim 73, wherein the selection agent is chloramphenicol. 

10 

75. A nucleotide sequence of interest which is essential for viability of a bacterial cell 
isolated by the method of claim 70. 

76. A bacterial cell comprising an exogenous nucleotide sequence integrated into the 
1 5 genomic nucleotide sequence of interest, generated by the method of claim 70. 

77. A method for determining whether a genomic nucleotide sequence of interest 
resides within an operon, comprising 

a) integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest; and 

b) determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable, and wherein the exogenous nucleotide 
sequence lacks an expression regulatory sequence. 

25 78. The method , of claim 77, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

79. The method of claim 77, wherein determining comprises selecting the cell having 
30 the genome so integrated in the presence of a selection agent. 



20 
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80. The method of claim 79, wherein the selection agent is chloramphenicol. 

81. A method for inhibiting a function of a CEG polypeptide which is essential for 
5 viability of a bacterial cell, the method comprising contacting the CEG polypeptide 

with the ligand of claim 32 under suitable conditions thereby inhibiting the function 
of the CEG polypeptide. 

82. The method of claim 81, wherein the function of the CEG polypeptide is selected 
10 from a group consisting of a pantothenate kinase, a Holliday Junction branch 

migration protein, a single stranded DNA binding protein, a phosphoglucosamine 
mutase, an acetyltransferase, an uridylyltransferase, a malonyl Coenzyme A :ACP 
transcylase, a 3-oxoacyl-ACP synthase II, a 3-oxoacyl-ACP reductase, a 
phosphomethylpyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
1 5 binding protein, or a 4-aminoimidazole carboxylase. 

83. The method of claim 81, wherein the CEG polypeptide is selected from a group 
consisting of CFE1 -113. 

20 84. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55. 

85. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64. 

25 



124 



WO 01/49721 



PCT/US00/35604 



86. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55 and the ligand is: 




87. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 




10 



88. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64 and the ligand is: 




15 
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89. A method for identifying a ligand in a sample which specifically binds a CEG 
polypeptide, the method comprising: 

a) contacting the CEG polypeptide with the sample under suitable conditions 
so that a complex having the CEG polypeptide and the ligand is formed; 

b) recovering the complex so formed ; and 

c) separating the CEG polypeptide from the ligand in the complex and 
identifying the ligand so separated. 

90. The method of claim 89, wherein the sample is a tissue or biological fluid. 

91. The method of claim 89, wherein the ligand is an azabicyclodiene. 

92. The method of claim 91, wherein the azabicyclodiene comprises the structure: 



93, The method of claim 89, wherein the ligand is a diazalactone. 

94. The method of claim 93, wherein the diazalactone comprises the structure; 



95. The method of claim 89, wherein the ligand is a TV-protected amino acid. 





NO2- 
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96. The method of claim 95, wherein the ^/-protected amino acid comprises the 
structure: 




97. The method of claim 89, wherein the ligand is an alkoloid. 

98. The ligand of claim 97, wherein the alkaloid comprises the structure: 




99. The ligand of claim 97, wherein the alkaloid comprises the structure: 
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100. The ligand of claim 97, wherein the alkaloid comprises the structure: 




101. The ligand of claim 97, wherein the alkaloid comprises the structure: 

5 
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102. The ligand of claim 97, wherein the alkaloid comprises the structure: 



CI 
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&GPwr'koiaoiogue of SE Q :; 10 N0 * 1 '- - 

AT }ATr1rATOCAGGAATTCTrGCC . 

U TAG, LCOTACKjTOATCGACCTATm 

AA VATT 3TA0rTGOTGtTCATWAGACTGGGTrrCTCATG^ 

TTi ITAA KJAACGTATCATCATTACAMOGGIGGTQCTG ACCGGAATACA^ 

GOIAWiATGOTATCGTCTG^ 

AC VCTT IGCATGATICAGGACAATATCXIAACrTGCCCAAAATC 

CO KSTT SATAPTATCGTTGAAAGTACGAATGGTCAATtTATTA 

TC ^AGG \CAAACACCTCAAACATTCCGTTOCA AGGA CTT€A 

GA 3AAC GAAATCTTGACAGATGCATGTAAAATCTTTGTGATCAAAM 

Gf? rdAA rACTCAAATCTGAAGATTACAACCGTAACAGATTTC 

GA HAG 

SCTa'^homologue of SEQ. ID NO . . 2"'. 
AK JGCTAaCGTAATTATTGAAAAAGcTaA^ 
a rAtCCGTGCTGGTCGTtK:CAATXK^ 

ctcttaaocaaatogcttcaAt^ 



ma 



CL LOW TGATttiAAAGACATCGAC^GIGCCrTGAACGCTTGTGA 
Gfr HCT( tTCATTOG CI T G G'rtATCOCAGCrCTr^ 
Aft iAGO rCGGwAAAATGCTAAAGTGGCIX}TCCGCAATATCCX3TOT 
AA< VtAG ^AAAAGCAAAAGAAATCACTOAAGACGAATTOAAGAC 

A& J3j\Q lATGCTGTT^ . 

^■CfllJ "i?omologue ; of SEQ. ID- NO. 3". 
ATC odtfAAATATTriTK^ 

TT/ A&d AGGACGTnTGGAGGCTATGTTCTTAGTCAACATO 
ACC IT0A< lACACGTATTTGACKj<MAAATGCrGG 
CAC QTK ACAAACTTGGTOtCCrTCC^ 
. CCC GtOI CttTGATTItntlCT^ 

CTl CAM CTAGATGATGAAAAAGAAGCAffAAArniAA 
TOdTCC^CntCAGAAGGClTAGGAATnTGGTA^ 
CTJ GOTt ^ACTGOAACTCCTCTrOATCKJAA 

CAC t6cc cgtqaaatcttitk^^ 

AAC A AOCITAATdTTGGTTCAACACATCCA^ 
GC1 ATTG GTnriGCCITTOATGGAaACAGTO^ 
ATC OTTQJ CAAGATIATOTACATOATCGGAAAATACC^ 
TOT 3ACA ACTCTTATGTCT^ 

AGT QGAG TTOGTXlACCGCTACGiTGTTCAAG AAATGAGAA 
CT j 3TCA MTTATCTrGATOGATTACAATACXL\CAOQTGA 

AA T CATG AAGGAAACTGGTAAGAGCTTATeAGAGrreGb^^ . 
AGT f AAt \TCCGAGTGGAAaACGTCATGAAGGAAAAGGCCATX3GAAGTGCGAGCT 
CQA 3AAC ATGGAAOAAGAAATXKK^GGGGAACGGGCGTATCCTTO 
CTT< 50GT nTATGGCAGAAGCOQCTACAA&AGAAGAAGTAGACTACT 
Gift X5TG ^GAAATTGCKUTTGACTAA 

^CF^l"homp2>ogue of SEQ. ID NO* 4" 
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A' fGAA ^AAA^TACTAATTGTAGATGATGAGAAACX^ 
AAQGT rATGAAGTTGTAACTGCTTTTMTGGTM 
TV JTA( TATTCTGGATTTGATGCTTCCAGAAATTG 

Q< ^5T( JTGCCCAncrrATGCTITGAGCCAAAGAT AGTOAATTTGATAAGGmTCGGTTTGGAACTT^ 
G< JAGA X^Cf ATGTAACGAA ACCCTTCTCCAATCGTGA 

TCIt AACCTATGCCAGTAGATGGTCAGGAAGCAGATAGTAAAOCTCAAGGTATC 
A( rAAA rTGTT)XAGACGCCTACGTGGCTAAA^ 
Qi ,GCIT TTOTjATCATIT^ 

G< IGGTT ATG^jCTATTTTGGTGATGTCCGCACAGTrGATGTGACT 
AC IATA( JGCOrkGCCGACCAGAGTATATCTO 

tc}a ; || 



o 



tat 
GAfl 



A^A 

ol ATP 

CO 



2QE5 "homologue of SEQ. ID NO, 5" 
J\l GGAAGAAATTCTCTGTATTGGTroTC 
CC CCCC AGlTOGCAinTCAAAAAGGTTIG 
CC ACT^CAAl^AAATCACAGATGTCCAGTrc^ 
0/ CAG1 GATOCTTTAGTCGTCAATGI^ 
AC G7TI MTClTX^GGCAATGATGIXXriC^ 

1]C rGGT MtGWTTAGCCAGTGGCTCATGAAACGTGCCCATOAAGAAGGXCT^ 
TA ACrt ^GCAGAAAATAAAtATCkX^TTAAGGAAGTCATro 
GC GATf ICTATCTCGTCGCTGTGACC^ 
" CGOC TGAWAGAATGTt^TCACTACTTCACGCTTO 

rGA(P SACG^ATCTTATATIIrACGATAC^ 
CG 3GCiV AAAACCTCA AGTATGTCAGTCCTAA^ 
GC *AA£ CCTATTTTTAGGTCGTTTGGGACGCTTT^ 
TT njrt iATAATGAACTCAAACTCCATCOTAGCAAGCTTGA^ 
TG jGAACTCTtCTGACACCACCAAATAG^^ 

:CAT rAAAGATAAOACAGACCTAGTCATTTCAGGC 



a:i 



GCC }TGTXXiGCACCAGAAGGCGTCGCCGTCGTCaCAGGAAAAGCAA 



GAC 
TGGJ4CTP 



homologue 6 f SEQ. ID NO, 6" - 
ATfTA^tXXAGATGATAGTTTGACATTGCACACGGACTTO 
ACfAA<5j^TTCACAATMGAAGGCGGTXJm 

' TTTGCAGGTTTGGAAAGAATTGTGAACTATCT 
CrtATTTtaGAQTCGCTTGC 

Qt GGT CTGGCCAAGAAGGG&ATTTGGTTTTTGCTAATGAArc 
CQ( !AAT( iTCAGtrGGTOGAAAGGGCTCTITrG 
AQ< rrOC} ATTCXJTTCGGTTATbGAAGATGAACXXTTC 
OA' XICQ< JCCAlxnGGGGAAC^CGC^GCTGTGATTGa^ 
GCT AAG' nTCTTXGACATTCCTGTTrra 

TG; AGC TTCXAGGcnTACGCTGOGACCCACAAAAATlG 1C 1 i UTGOATAOCTATQACAOOCTTC 

GG> TCG<fKnACCACKnGCC^ 
^7 GAT TGA^t TCTffiGGATATTOC^ 

GAQGCft AGATTTATGCTTCTAATGATTTGGACGAA 

J TGTdrcGQGCGTGGGTACCAAQCTGA 
CA/lGATI GfTGCAATOGAAGATOAAAGTGGTt^ 
AA/ AGfC rrCOACGCCAGGTAAGAAGCAOGTGTGGCGCATTACCAGTCG 
OTC ACT/ CATCACTTATGATGGTOTGGATATTAGCGACA^ 
Ta]C ATAC ATPAiAGAAOACGGrrOOTAATTTrGATGCX^ 
~ ^TjTAOT rrACAACTTOCCTAGTTTG^CTOA 

Ojl ATAAGOTItnGCTC^TXXGCAGCACTATC 



'homologue of SEQ, 
jCTATTA 



TTG TOA' 



ID- NO. *7" 



k ^AT]PCAATGGTlTC<aGGTX^CATC 
rrTTGTGACGATmAGTAGATGCACGCTro 
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V 

. V 



XTjrQnbdTOATAMCCAAAACrCTm^Tm 

A/ TGG< GTCAWATTTTGAATCAC^GGMTC^ ■ 
A/ aAOI TOTAACAGATGCGGCCAAQAAGCTCATGGCTGATAAOATTGCTCXlCC^b 
TC AGAT raAAAOHTOGGTACiCATG* 

7T 3GCI XrrAAAAAGATTCCTGTTGTTGGAAAC^GCCAGGGGTC^ 
AC CAAl AAAQATCTGGAAATCTTGGATA(^€ajGGGATTCnCT 
Cfi CTTA \GTltKKlATXC4ACTt3GACKn , ATCAAGGATCAGTTG 



<W AGCC CCTGTGATTATTATAGATATXJACCCGCXjCCCTCGGTTTCCGTG 
TC rQTT< !GTGAAGaAAGTCCX}TGATGGCAAACTCGGTAACTATACOT 



CO 



2 CPfc8 "homologue of SEQ. ID NO, 8" 
AT ViTtkACAATGTTQTACnCH'AGGGCGTATGACAtt 

ta jCag rroc0AciTn'ACTCTr^ 

ATX AATOT^TTATGTGGCGCCAACAGGCTGAAA^ 
AttG0<5 JTOACAGOTCGTATCCAGACTCGTAGTTACGAfA^ 

jGTC 3TCK3CTGAGAATTTCCAAATOTTCGAAAGC 
TK rrCT( ICAOWACTGCAAACTATO^GCACCTA 

ca> rrrooAGcJUcAAACC^nro 

2.CF£?"homoibgu^ ofi SEQ. ID NO- 9" ''■..*. 
ArOAAAlCCCTC&TATTACAGAATTATTGAAGA 
CT( >ATCj< rTOATtrnSGGAGGGGCTGTTTXX^ 

OJl IAG4 4GTTGTCAAGG<XZAATATT^TAAAATCAAATCATTC . 

AflyCTC TATCJCCCTTTCIVMAAGA 

Aft 3AQ<p VOTAAATCXlAAGCAAGIATATCGtAAC^ 

C<7 'A|(nj( rCOT<MTTAGCfAAACGCAIGGAAAAAATC 

OCT 'GGG JGGCATATCGG1AAATTAACAACGATGACCTTM 

CjX \TTA? TGCT«!AaaAGGAATrGCGGATCGTGAAGCnG 

TXtfACA<IGKK3<^CACGarnOT . 
{KiAAGGOATATrGACACrAOGATr^ 
OACTAdAOATrrroAACTCGCTOAAAAAGATGC^ 




<nTfTQAACAAATGGGAGCAGOTGC^ 
ATC IGC&teTCAAATCGCAGGGCT^^ 

a<^gaoccgctAaoaaaattcaagaagaac^c^ 

gCHflO "homologue of SEQ. ID Nbl 10" - 
ATC ATCdATATTt^GGAATCAAA 



\^ on 



GG; AGT^ lAGCCU.GGATACCATTGTnjCTATGAAAAATO 
TTT ECTtp lATACCCAbTrATG0GAOGTQTTCTOA TTAltK ?AA 
OT1 ATCfl AAACCTGAAAAtAAAGGAAAACTGGTCTnTACGCT 
GO AGTl GTAGGAGGCGACCAATXGGTTATGAGAGCGAC^ 
GTI OAAC GAAAGGCTGAAGTGOAIXIGCAAGGTTGCAGCCAGTO 



AA 



ZCFmW'homologue of-.5E<5. ID »NO. TJ"- 

AT^ftAjATGAAATTfA 

VXCATATtXHTATCOTTCCCAACTAC^ 
WSATCCCAAGATTTTC^ 




A*G ZAQA GTGATCMGAArrCTATOAAAACTACATC^ 
GGT PTATl JAGA ^GTTTQTTTCTtn'CCCTAAAGATTO 

GCA 3<?CA TTAC ^G AGTTTGTCAGTXFnKSGCATGCACGCtATGA^ - 
AGG 3GG/ GCGC jfVTCGCCGTtATTGGAGATGGAAGTTTA 
m^CC^AAGCAGAQATTCTO 
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TATATTACGGATAATATTCCTGAAGATrTG 
^BGaCC^GCTATTAA^ 

^ TA /U^CAC^GCSCTTrAAAACAGTGTrTAA 



cipjEll "homalogue of 

aactt> 



OTaactactt^ 



TO JAAqTAO 

ta;ga 



cerA 



AA 3a)aT MjSnATCCAAATOGGC^ 

r£ Sea jIatSaTCaaa^agcaccttatccatttagcctatc^ : 



Do 



^ Ail 



^ AA 



SEQ. ID NO A 12" 



CTC iCTTATACTCOCATtJCTG<JAATrrCGTQAACTGC^ 
TG^CtJa 



^ JffeR ^tqaaTACCTACGGGOACAATGGAAACATCCTCAT^^ 
A® :cl5 : CTGAeCC^ 



A'XJTGACCGTTGACATCGl ... . „ 

TCdGTG<iTGGTCAAGACTTrGAACAAAGTATCATTGeAGACGACCTACCrc 

AC; ACTACAKKAAAAGC^GtGTAaTTCTGGCT 

G^^CAGGA^ 

( Sari WATtoGtfraACAiSuus^ 

^ AA® K3CGTAGCTflOCTCICIQATC^ 
CGV AGA/ AAGGTCGCTOAAGGGGITCATTATAAG^ 

TCI OGTA ATG^fcAATCiPGGCTTATCGCCT^^ . 

CTC ACTT rTCTlAA 

2CF1I14 ■ ~ •' • . • 

ADC AATdTAAA^OAAAATAGADAACTraTTTnCGAOAAGITGCA^ 

ACM GTGC TTCtiGlCTCJtnCATTGC&GTTATC 

-^^i^xcAj-xc^aTaAAAATroTGTAGATAAGTT^^ 

SGCAiTEOATTQOTACCTTOCAAAOACOTAAOOTCAAA ' 
■TTGGACTCAGTAAAGCTAGCAGGGGAAATTCAAAAAAa 
F^GTAAATATTTCTAAAGAAGAAAGCAAACACaGTTTTTCGAGAGAGG 
iTAG&CnTAGCCAGACTAGATAAGATTOAATATCTTaam 

-- .GTTGAAAQAGATTTTCAAGGCGGGCCAAOATTTACAAAGAGAAATTCA^ 



GCTAOGIGTK 
COAOATGtOAi 

AitrvmrijcA' 



AGT3TTTXJT 



ft^TTG 



GCXVGCAGTG, 



CAA VTTCCAAA' 

TCGiTTCi 



rGAECAGTTGAAAGAGATTTTCAAGGCXjGeCCAAOA 
UKATATGCCTATGACCGAGTTAAGTATGGGAAtGAGTC 
i iACTTrTTGTTCGTATAGGTACATCAT^ 
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•0 



i i . . 

4C\<EAS "homologue of SEQ- ID NO. 15!', ■ . 

A iGGGtaTTGCTCTAGAA^ 

GGaTG TTCTTTGACGATTGMGATGGCTCTrATACAGCrmAATrcGGCA 
?t TAT TI*CAACTCTTAAATCiGTTTATO 

O GACWCTAAAAATAAAGATATTCGTCAAATTAGAAAACAGGTTGGeTrGG 
q JftJ rCA^TTTITGAAGAAAaKim 

7j JATGCTGTOAAGACTGCGCGTGAGAAACTCGCTCTGGTTGGAATTGATC^ 
?/ GTQ GTrTOAGCTGTGAGGGGGACAAATGAGACGTGTTGCCATrG 
CT TC TAWTTAG'TCrTAGATGAGCC^ 
^ TC S^C^CCAGTCAGCm^ 

Tf CTA/ ■tcaaGTOTATGTAATGGAAAAGGGACGTTT^ 

5 ACQ- 4gTTT^ATGGAAGAAGTTGAGTTGGGAGTA(XTAAAATTA 

$ riS .GGOaTXnXJATTTAAACGATr^ 

e CI E16. 'homologue. of SEQ. ID. NO. 16" ' . ■_ 

xi ndAllATTCl&ATTTTTAGGAACGGGGGCTGGTCAGCC 

TCiUtjtfTCTTBGATC 

£aXu:CACA^CGACC*C<^^ 

CC TtTG -CAOTrnWTTTCTAGCTOTGCCTT^ 

OA CCTC \AGGAATCAAGTCATTliGTCTTAACCAGCCTKXl 
^ O\m0^TOA0TTTCACCAAGATTCTCritKJCT 
^ OG AGC1 GG AGCACACTATTTTCTCTGTTGGCTATC 
<M. GC TGAA AAACTCAAGGCTGCtGGTOTTGCGTTCXKSGOC 
LT TTi ITTtf X30AAGA0GGAACTOAaATCAAGGCAGCAGACTATATCT(^GCG 

CA rTAT rTTAGGAGACACTCGAAAAACGGGTGCCAGTGTGCGTCTO 

Ck rGAG rcCACITATCGCAAGGGTOATGAA AAAATTCCTCGTAA 

CfiX^^GTAGCGOTAOAAGC^CKnGCC^ 

AA (VAte/ TATTAGCAAACTCAAGAAAGACGCTWCAGAATTrrTGAAAATO 
OA KG AA GTGGAAATCTAG 

^GPIS!? "homplogue of SEQ. ID NO. .17" . 
ATI 3AGTjC\TA]TCAGTrTA^ 

TT( IGAG. VGTCCATTTTTGTfTTCAATOTAGGGTTAAAA 

GG' [GAT •CCAMCATGGATTACCTrTTTGAAAATAGCGAC^GTATTO^ 

AK !CflG iTGCCATTGGTOCTCTACCT 

AO VVTift ^GT*3GCAAAGCTCrrrGTC^^ 

XTC ATGv iGAATACGGAGATtOATTTTGOTGGGACAGTGG'l J ICCTICI lOXTAC^CTTAClCCGria^ 

OX( JAOT rrGGdAATTGTCTTOAAGACAT^GQAAGGAAGCA 

AA iCOG rTAGTGAATCTTATG<XACTGATTTTGCTCGTTTGQCAGAGATTCM}TC 

TCh CCTC \GTGATTCCjGCGAATGCAGACAGCAATATrCAGGTGGCTAGT^ 

ATI AGCC AAACTATTGCnOACTQCIOAAGGTCOTATCATCOCT 

TG> GCA( ATTITTGACG<!TGCG<JATAAAACAGGTCOACOTATCGTC^ 
cvk Af< GTCC QCACAGCGATTCGTCTTAAGAAGtTGTCTrTAGCCAACGAAATT^^ 
^ AT/ TGTC TCGcVrTGAAGACCATGAGTTGATTATTCTTO^ 
<m ACI TG01 AAGATGTCGATTGGTCGCCATCGTTATGTAGAAA 
Cp"- ACC GGTC CGTCTAfTGCTAAAQAAGCCrrtOTTGCOGGTO 
^* TTG FQAA ATTGaTTACCCaAAGTTTACATGTATCAGGGCACG 

CA> TCTT nTGC^CCtAAGTACCTCTTPCCCraTCCAAGGGGAG^^ 

CKS MC|viCK3c]£am^^ 

TOC AQACTTrGTTCCAQCKXJATCQQTTTCA 
fit jGAMtGTTGTTCTTCXjTOAC 

!AA CCCTtGTGAGAAGAAAATTGTGGCTAGAGCTCGTGTTt^ 
gaWgag' CGCGATATTCTCCGTCAAAaTTCAGAATTC^ 
AOA TGAC ITTGACTGGGCAGATCTCAAAGGTAAGGTTCGTGACAATCTG 
ACfc \Abfc GTCGIeCCaGCCaTTTTACCAG^ 
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i (CFfel9)i ,l homologue.of„5Ep J ..ID. ^O.. 1.8" 
AT bXcfi AM^AATTICAtCATGT^ 
AA GGf / TCTACGTTGATtKJOACrnXIGGCGOAGCAGOA 
AA SMK iCCATCTCTATGCCTTTGACCAGGA 
TA 3 ATt 3AGMGGGAATGGTGAC 



CCmGACCAGGATCAGAAT< 

XCTTTATCAAXiGATA&CTTCCX}!^ 

rrrfcffATAAAMOGATGCGCCACTGGA 



4C3JGC 
jn TATGAAff 



if 



CtOTAAfU 
AGCAG/ 



CArrcA 



uuTGG^AATCATTATOACTATC&TC 
\GAGATTCCGCGTAAGATTOAGCAAGCGCG7$ 
AG&AG/ QATTATCAAGtTGGTCAAAGCTGCCAAGGAACT^ 
5GCTArrcGAATTGAAGTCAATOATGAACTG(^ 
nXTOGATOGTAGAATTICAGTGATTAQ^ 
JAAGCTTOVACAGTTQAAGTTCC^AAAQGCTTC 

XfcAGTrtaGCGtGGTGAG^ 



'TCFtel "homologue of SEQ. ID 4 NO. 1<9" J 
AP }AGf KGAATmAGATAATGAGATAATGGGGGATCAGGAGTrAGT^ 

TA TTA( JGTOAATATATCGOACA&aA 
TOX3G0 UXUftGeGCTGOAltfcTO^^ 
XG ; TAD QCCAACGAACTOGQAGTCaAIOT 
GT< blA< ICTATTITOAATGAGTTAGAGOCTGGGGATGTjXn^ 
QVi IAGK K»AAbAqQTG<ntTATA<nXCT^ 
QG XGT IGTAQtOTTCATntSGAGITA 
CT< ICAA" XXGQTACXiGGCACGTnTGGGATrACAGGCCA 
GX, iATT< iTpG^GOGGACGGCSAGATATTrrTQAGATOG 
U*-- TA< «JrA< ITCGTGGGAGCCCTCGTATTGCCAATCGXCI^^ 
GG< JAXt }GG01TAATltUTGATATTATTACCOATAAGGCTTTO 
T0< >ACTV LTGTGGATC^AAAATarrTC 

Ai( rrCTI TC7TOTOAATATCXK2CGAAGAGOT . 

aLlGOTTTATCATOCGGACACGGTC^ 

AAVACWiTGAAAAAGO<teCCGCACTCT^ 

. 2CW24 "homologue of SEQ. ID. NO. .20" 1 ' 
ATC AGTHTGTTTTTaGAT^ 
TTC Of CC TO AXAAATAIXjTCCCT^ 

CGTT 5TAGACGAAGGACTACGTACCITGATC 
GtC AAA> AGGQATGACC^AAGGGATGCATGGTCGTG^ 
; G1A CQAC TGTT£ttTGATGCGGAGACTGGC^ 
CGI TGCC CACC^TGGTCGTGG7X}GACGTGGAAATATIX^m 

ATC rcrc ivaaatggagaaccacotcaggaacotoa gtm . 
u ggStta^aggattoocatctctagggaag^^ 

^ TOCXGaiAa^CTITAC^ 

K CACTAGCCGACTIXKX^GGTT^^ 

\JT TGA 2A7C aAGCGTACACGTGTTATCCTTCACATCATTGATATO 
AGC ATTA CCTaGGTATCAATAAAGAGCTGGAC^ 
TGT \ACT lAtAAGATCGACATaCCT(^GAGTCAGGAAAA 
AAA TTAT IaTOAATTTGAAGaGTTACCAGCTATCT^^ 
ACATOrAGATGCTACAGCTGAATTC^ 

TGG \AG^ AGAAGTTTACTATGGATTTGaCGAAGAAGAAAAaGCCT 
CQA 2ATG CKiTAjpTITClWTG AAAAACTCATGAAA 
GTC tfT^fcATTTGCCCGTCAGCT^ 
ATG X3GA TTTGGTCCC&ATTGGTAAAT^ 
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AT3AACTAI 



^ TA 



i "bpmotogye- qJL.seq. id ..no... -2;" ; - 

ACftTAATOTTGGGAAAATCGTTAATACGCAGGGATTAC^GGGT^^ 
TGkCGCUTTTTGCAGAA 

G rCCA UCAQTCACCATCGCTAGCCACCGTAAACAG^ 
A n CA+ VTCA^TACTATCGAAAAGTACAAGGGATACAGTCT^ 
CT Iq AC 3ATCQTOMTITrACT^ 

'ATCaaCGAAATCCTGCaaC^^ 
« rTTTACCTTATATCCCACCAGTGGT^ 

2CFfc» ^'homologue of SEQ.ID NO'. 22" 
AT 3AAC^TTGATATTTTAACCCTCTTTC 

TO 1AGA /VAAA'GGGCTCTTGGATATCCAGTATCATAATTTrC^ 
Q\ TGAT jAGCflXTACGGAGGCGGTCAGGGCATGTTGCTC^ 
CTi ^TTGi |AAAQAAAAATCCGCGCGTTATTCT(XTXX}ATCCT 
Tft tAfJA rTTGpCIX^AAGAGGAAGAGCTAATCITrAT 
AA 3ACf rTGGTAACAGATGAGATTrCCCTAGGCGACTATGTTCTCACrc 
C& ITGA rTGATCCTACAGTTC<KXn , dArtCCAGAAGTO 
TT TTCI TCAGGTCrrrTAGAATATCCTCAGTACAGACXjTCCCTATC 
AF< 5TAt laATOAGWKHt^CCATGAAAA^ 
CO. K3CG ^AGGCCGGATTTACTTGAACATTATCAACTGa 



it* 



TG 

:CT< 

.ATCCi 



CAi kAO* UACAMGMGCGGCCG<^Cr<^^ 



"hdmologue of SEQ. IP NO. 23" 
AT< *ATl<|lAAGCAAGTAAATTAAAAGC^^ 

i< iAAG* rrAGTCACCAGAAACCAGGTAMGGAAACACGATCATCCGTATG 
<JGTO TACATTTGACACAAGCTACCGTCCAGAGGAAAAATtTGAACAAG 
litACTlWTACAAAATGGATGACA^GCATAGTTCATGAATACAGAA^ 
TAGT£AaTGTTGAAAaCGAATTGCTTIACATO 
AWctGMGTGATCGGTGTCACCGTTCCTACT 
AK AAAOGTOOT^CTO^ . 
CAC lAtr t^TGGAAG<^GOACAAAAACTCGTTATCAACAeTGCA 

coaQc^<:c^c<^ccaccaccactga . 



^nbs^'homoiogue of SEQ * 10 N0 - 2 - 4 " 

- ATC GCA'iTTG/lAAOTTTAACAOAACG 
CTC AATC TCATjGTCCAAGXGGCAACGAAAGAAATTCGCre 
TGI TC7TA AAC^CmATCAAGAAAGm 

CC1 GCGC AACAGATTATTAAAATCGTTGATGAGGAACTGACAGrc 

TTA IX*AA GTCA^AAGATiaiAACCATW 

TGC TGGI AMTTGGCGAACAAACTCAAGAAAGAAGAAAATGC^ 

TAT CGTC tUGClGCCATTOACCAGCI^^ 

A AC AGA> lGT A<^GCfGTTGAbAtlX7M 

GTC TTQA rTOATACTOCQGGTCGTra 

GAT TtyOC FCAACCAAAlXIAAATCTTGGTroTCGTTO 

GCdTGAqrrTAATCCT^ 

GGT 0CT6 CTCftJTXTCTTCGTCACArTACTO 

CGG ACAl TGAAACCTTCCACCGAGACCGCATGTCrAGG 

GAT TGAC AAAGtrTCTCAGGAATACGAT^^ 

CAJC Ztft< }Attt rAATGATTTCATCXlAT^ 

CT 1 IGATTCCAGGTATGGCCAACAATCCAGGCC^ 

gc r :gta vacgtgccattctctcttcgato^ 

GCC jT£G XGTCGTATfG<nX3CTGGrrCTGGAAATACAT^ 

AAC ZkGC CTAAACAGCTCATGCAGGGTGTTATGTCTGGGGATATGAATAAAATO 

ATT UTC OAAAjrAACCTTCCT^ 
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1 r< iATG SGaCAAGGCGGTATWCTGACTTATCAGCTCICGGAGGAGCAGGAATGCCAGATATGAGC^GA 
TOTTTOGTGGCGOTnGAAAGGTAAAATTGGTGAATTrc 



'2 Cfe *1 



. TO 1 
AO 



GT/G 

accac, 



2CEEJ2, 



V$i' V PAAA^rGAApAAAGCGAAGAAGAAACGC^ 

£C ?ES9 homqlogue of SEQ. JsD NO. 25" 
A XTTAtCTTATTGAAATmAA^ 

r CCA< TA^GOTCACTTGATnTAGCAGAGGAGmATCCM 
T< !CAT< itTTi^TCTCGTGATTCACkrrTGGroCT 
C( rrm AAAGfcGACTMGGACAAACACXJAAGrrrc^ 
a ACT}] TGCCrnTACTTGGTGTCtTTAAATrro 
0, r GCTC TCATbTTGATTATCTACX3GGGTTXjCCTTC 
to O2 iGCCl lAGTGTAAC^GAGTTGGACAAGOTCCTnrATACGACCGC 

f< ClfCI mACCAGGGACTAGCCGTTX^GGTGCAACGATTC 
rjT^ GTTt TGAGAGAATTTACCTTCTATCTTGGGAT^ 
^ £ ATTtlGTGAAAGCCGGAGAACTCTTGAGCnTTGGGCAATIGTn^ 
CI nfrA ^GrCACKIATGGTCGCTATTCGCTTCTTC 
<k rTAAU lTaCGGTATCGTGCTTCGTAGTGTTTTCCTACTT^ 

C/ CCAC CACCACGACTGA 

1 . • ■ 

• 1 

2CBB30' "hoiologue of SEQ. ID NO. 26" " 

GWTTaTTTOACCGTCTATTCGGAAAAAAAGAAGAACCTAAAATC 
A/ AATCtrGATTTGTCTGAAGATGTTGATC 

A^GCAjC aggttgaaattgttgaacaagctgtgttccaa 

?ATTTAGAGCCAGTTGTAGAAGTTrcrrG^ . 
GAUTAC TGAGIrrrCTAGAGACTATAGAAGAAAATAATICTGAAG 
AG AAG/ AACGGTTCAGGAAAAATATGACCGCAOTGrrAAGAAAACTCGT^^ 
OA AJOC (nTCTTTGCTAAGTTCCGCTCTGTTGACGAAGAA 
TG VCJTG ^TGTTGGTOIXXIAAGTCGCTTCTAACrt 

XAA lAAACCTGATXKIACTTCGTCGTGTC^ 
AG 2TA€ lATGAAAGCATCCACTtCCAAGATAAinTa^ 
I \AAA CAACnTCTATCGGAAAACtACX3CC^CCGCTACA 
J SCAdJ fcTACtlTCCGTG(X}GGTGCAGTAGCTG^ 

AG rAAcrcGAOCTGAAAAAacra^ 

Gf TATCi JAT ATTCTCATGATTOATACTGCTGGT^ 
AA VAGA ITGGlCGTATTATGAAACXlTGTTOTCkXA 
Tp JlC^i xGTcAA^TCCCCra^ 

TO iCTi^ LGATTOATGGAACTGCTCGAGGAGGTOTtXSTTCTAGC 
AA LATIp 5ATTGGTTTTGGTGAAAAAATCGATGATATTXX}AGAGTTT 
CTC AAGCmTAATCGCGGCCGCACTC^ 



homblogue of SEQ. IP NO. 27" 
TATATTGAAATGGTAGATGAaACTGG^ 
TG^AATr TtK^GCCCAAAAATTAOTAAAAGAAGACMGC^ 
IGAACTTAATCTGOAGTACCGTGACACCGACC^ 
^TTOGAAATTXJCCTTTGACGAAGAC^ATrTC 
iTOqCTATATTGGGGAAtTOTTCATCTC^^ 

^GCGTOAGAtGGGGTTCTTGGCAGTACACGGCTrTTrAC^ 
kGAAOAAGCGGAGATGTTCGGTTTACAAGAAGAAATTTrGA 
i jC^CCACXACGACCACGACTGA 



gactt 

<31t ACAfSCTTTj 
TATACTqCGG; 
AG/ 



homologue' of SEQ,. > ID NO. 28" 



V?. CTp^CCkGACTiXJOAC^^ . 
* GTTn^USGA^^ 
^ TQCpf (jjSAAA^^^^ 
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X 



J>CFe3t. 
illaMiAAATrGCAGAtK^^ 

TC CCTI GGGTGCTOTCTTACTCATGC^nTGCG ACGCAGQCTGCCAAATATTTaa^AATXJTGGAqATTA 

tl GAG< raMCAltlACAAGA^LA^ 

ft GAG< iTTCGAGAGTCC^TTCAGCAAGGOjCAGCAGATG^ 

m q aC] rrOATG(3TATGCGCATCCACTCAGlTCGlTTGa:ACK3CTTGGTAGC^ 

GC ICAA' taUMQAGWUKXaTOACCaca^^ 

GMWAMGAAGTTGTCAAGCGTCATGAGCTrcTC^ 

CCACj^CCACTGA 



"homologue of SEQ ID. NO, -29" i 

A! ^C^UCAMCAAGATTTGATCaCTAAAGTAOCAG AAQCTA ^ 
& GC AGCy GTroAAGCTGTATTTGCAGCAGTAGCTQACTATCTTGCAGCrcGTGA « 
•2 £ STAACTTTGAAGTTCGTGAGCGTGCAGAACGTAAAGGTCGCAACCCACAAACTGG^ at 

U: OA CAAlTaCAQCrrrtAAAOTAOCAOC^^ 
(^CCACCACX?ACCAOCACTOA 

» ' 

2GBS2W "homoloaub- of SEQ. ID' NO. 30" 
XT ^CT^AACAGOCTTTrTATTTGCrGGTCA^ G 
AT Mil ^TCCGATTGTrAAAGAAACGATroATCGAGCGAGTCAGGTGCTAGGXTA T 



.1.1 



CCtrATCtrACCQTTTATTGCAAC^AAA.GQGCT TA 

CT TOO nT^TCGCAAGCGGCGCCTTCGAT C 

IX iATG }AAQ&GCGGCTCCXGCTQACZC^ 2A 

TO JAAG \AQCpTCTCAAAAAOGTItn^ A 

AK CGGT JATTGCTOGAGAAGlXkiTTGCAQITG u\ 
<X CG nTOATTCCTCTTAAGGTGTCAM 

Vfc TU rCTGGCTCAGGTAAGnriTITCAGATTl^ A 
Ttti ZAAA UG^GACATTGCTtUGCTCTTO^ 

TO IGGT ^TGQAAGAAGCAGgCATAACklAACTTTATCOAGATTO HT 

Or AAA ufcAAJTOATCAAACTOCTCACrTAGCro G- 
AiJlAAC rCGAJ^CC^CCACC^CXIACCACTGA 

' "hofaologue o£. SEQ4* ID NO... 3ft* 35 
ACTAGAAlCATAAAAATATCTrTATTA 
1< ri^G»GGAGCqAAt^ 

"~ TGGTATXiAAGGTCGTTCGCATTTC^GG 
. . p ~~ rr~ r - ^GCTAXTGCAG^CTOGGTTC^GTw * * r/Y ™ 

at> cifei tatoctcaaoatga^gaagcagAt 

^ TjA/ TATC A(^C AATCAGTCTTGA^CiCC£ATqA , 

GTJ GTTC GTPPATGPGGAA^A' 
rk GTC] GTGGGAGGCG AGGTC 



GTC] GTGGGACGCGAGGT0GCTAG1 




f ATflTtTAG^ . — „ 

teACCAOCaCCMCACCAi 



2CFeIw i "homologue of. SEQ. ID NO.. 32" 
ATG GGAj4TOAAAAAGAAA£TAAAGTTGACT^ 3 
COA CTAA TGOCJpTAACTAGCGATAtTACAGCOGAATCGGCTGA T 
GCf GAA/ TC^TTCGCTTTTTATCG^ T 
^ COntaCTCCTXHTG^^ 

GQA rTAA QGGGSpTTCGAGAACAATATCCAGGTCGAGA \ 
TiGCjTAAAGTAmAAAGAAATO 
v^GGTrATT TGGCCCTGrrCX^GCCCTATCAAiCU 
^ ACCITOGrAGTGTGOAtAC^ 

TTO X&A ICAi^GCTITaTCTGAGCGAAATGGC^ 
TTT TAf< TTTGCAGTTCATGCGCCAGGTCGAGTCGCCCT^ 
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T|TGCA|VACCyATTTCnTC 

A UjAI rTGOAMATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAQAAAAOT 

c^a:cactga ' ' 

2ctE37^homologue of SEQ. ID NO. 33" 
A' ^AbATTAGTAAGAGGCACITATTAAATT^ 
r GAT 'GTGGtrCTATTCGACCAOCAGTGCTATnT 
A \ACC VAGGi4AlCITrTGGATIGGTAGTrrGATACTO 
T( iAGA WTGAGCGACTAATCATmAGmT^ 

Q( }TAT TCAGTAAAGOGGGCATACGGT7X30ATTTCGGTTGCAGGAGTAAGTATTC 
TJ lAAA ^TCApATTATTKKnATTTAGCTCACCGATTCT 
n TTG/ AGTT ITGACTtJAAAATCAATGGCTr(XCCG 
a ittqt 1AAG TTGGGAArnTCCCTOATTTAGGAAATC 
A' ACA( riTA(fo6AATCGCTTATCG 
TC TTG/ CCACTATCAGCCTAATCGGTCPTTC 
V* C< CTTI A.GTGCCTrTTTTAATCCTTTTGCCX}ATCGTGC^ 



CCA 
ACCi 
Ala 



2CEB3S 



^ ■ 



GCCATC GTCAATGGTGGrTCGTTTCGTCTA^ 
U X3CTC ATACAOACTTTGTCTTTTCTATCGTOATTQAAGA^ 
CTfCtCir rGi 1 1 riCATGATTTTGC^GATTATCTTG 

" { rcaG^TCGGAGGGATGATGTTCGrrCAGGTATTTC 

7 ^CAGCSAGTAACCITCCX:cnnncnTATCCCAG 
Cn rGTXTnAAATATTXJATGCCAGfGAAAAACQGGCTAAGTTO 
AAjC CTlt^TTOAAGCTCGAGCAOCACCA 

ho mQlo. gpjs . of . SEQ . ID, NO , 34" 
ATjOCTC^AAlTrrAACCTTTATTCT 
CT ITQC AAGAAATCAGGGAlTrTAGTACGTCAATTTGGCA 
TT 3GCA VOGAlXJGAAC^jQCCTATACCATTCGAATCTTtiOCT 
GGjOTOA TGATACAACTGAAATCAAGAC^GGAACGCCTGlTAGTtTOAC 
VOGC ATCAATC TCTCA GGTAAAAAATTC 
i AK6 ^CAAGCTCTITATCAAAGGATTGGTTCTGG^ 
AAtXKTI rOTtyjLAAQCAGATQQTACTOAGGTTCGGATTGCAC 
Afl 7TQG SG CA^C TGATTACGMTTTTGCAGGTCCT 
QG mTi ATt^^ATGCAGGGTCGTOTCAGAGATGTTC 
CCfTCG^GMCtoTA 



<^G<r 3AAA GGTTGATCCAAGCTGTGGAAACAGAAACCAAAGATAAGATO 
— XTG> AAAGGCK3AG7GACAaACAAGTX^CTGTTAG^ 



TXT. TGV XKQQKKK^^ 
CTC TCSGC AATTCtCTCAGGTCTGAAAAATGTGATTTTC 
g^t-atg: TTAAGGCAAGTAQTGATCCTOCTAAAAATGGAATTOA 

tti GCA1 caatattgggatttttaaTcttattcc^ 



Gix aio tggttgtctitoatgattgctg 



CC4CCA( OICGACCACTGA 

£CVJ&39 . honiologue of SEQ. ID KO^ 35" 
ATCTACdCAt " * 



TTG 3TT A TATC* 



ATTTAAAAGGAATCATTACCAAAATTACrGCCA^ 
. :<rraCATGlX3G<X^ATCCTTATGCCT^ 
Git CATC AGGir]rGTG<XJTGAGOACCKX^ 

MTTjlAAGOC^TTOAAACCAAGAA^ 
AAC AGCC CACKIAGATPGTOCTGGACTTOGAAGGC^ 
GGT CQCA GTGCAACX^GTGCTGAAAACCMGAATKMA^ 
CTA 2AAC CKlU^OAGCrCAAGAAAATX^ 

TAT :AAG IXXJGpCCn'AAAATGTTGGTCAAACTCGAGk^ . 
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GCjACI 
ODAAA' 



2 CM 143 



to 

^ A 



AAA' 
AG/ A 



GO 



#B4G "homologue of SEQ. . ID; NO. 36" . - 

A' OAA^AArAATCGfATTTTAGCAGTTTCTQQAAATQA 
G< ICTA( ICTATJACCTTGAACGGCTTGCAT^ 
GC iATTl GA AQtrcmCCAACTGATGATACCAT^^ 
TC GGCH FAATrAAGATTGGTCttCTCGCTACTGTCAGTGTC 
CC CCO GGA4ttACCTGTGGT<mGGATCCTO 
A( CXClGCXJAAGAGtTGATTCGC^ 
n \TT$ rcCGfrK^GGAAArTAAAACCTTGGAAG 
G/ GCQ( ICAGCAGTCATTATCAAGGGAGGCAATCGTCTTAGTC 
TC GAC> GACuTTTACTATCCTAGAAAATCCAGTTATC 

GC ctcj agcaotgcc^gtc^cttc^ 

TC 3TTt UtXHtKHATra 
TC 3AGC ACCA|tCACCACCAGCACTGA 

lOFMl i "homologue of ~ SEQ, ID NO. 37" 
XT 3ATTtoAAA!^GAGAAAAAAGAGGAGCGAGTX:CrGCTC 
TT rqAO ^CHyCATGGAAGAATTGGCTAGTTT^ 
QA CAA> AACQtGAAAAATAtaATTCCAAGACCrrCGTC^ 
GG XdQA TGCA/QAaGAAATX^CTACTGTCAT^ 
0|a GGA> GTTCTCGGTGTTAAGGTC^TTTM^ 

"iCCA tgaagggaagctccaagtccacctagccgaactcaaat^ 

fATl ^TGCjrCAOCCGTCAGGCAGGGGGAATTGGTTCCC^ 
AAfcCGT DGTA 3CGTTCGCAATCLAAATCACGGATATCGAGCGC 
UCnTGAQAGAAAAA^^ 

1 CAACTATCATGAACATCTTGAC^ 
GG^TGC jACAACCAAQAGTATTCATCTGGGAGGCXACCTCCAACT 
ArpCAA UTTracCOACAGAGTTCCn^ 

G' TCATCTTATCGATGCnAGCAATCCTTACCACGA 
AaJ\GAC CTGGACATGGAAaAfATTCCTCACTTGACGCm 
'At X3CAAACGOCATATACCCT^TTTCrrXK3CAAG 
* XIATAAGATTAAGGAAATTTTTO 
AT< jATTTAGAGAGTGTTGCAATTCTGGAAGAACGTGATTATCAGOA^ 
GQtACA TTCGOAGAAAAATAAATGGAGGTTAGAAGAATITrATO 
CTOA 



"homologue. of SEQj ID NO. 38". 
AT$G<^GAAAAAACaTATXOTATGACCOT 

TTG<rrrCGTCGACCAGAAGTGGTAGAACtt^^^ 
GTGj iGTA^GAAGCAGCTAAGGATGAACAAGCCTTTCTCGAAGG 



TCA CCAT CCAAGAAATTGGTGAGGACGMGAAGAAGTTTATATTATC^ 

cn rocA ggtaaggtttcaaatcaaagcxjcaattc 

AG< AAC< ATTOAAACGCCTGTTGGTAGCTATGATGT^^ 



CCAX CACCACCACXACTGA 



OcSdo- "homdOo^ueJ-of .£Ep. ID NO- 39" 
ATG ACCAA^TTjACnTCTAGGCTTC 
TTA IXrtt 3ATT<jfATCAACTAGCGAAGAAAGA(^^ 
CCT \GCA TtX7TTTTTCCTAAATCk>AGAA 
GGA AAA( ICAGTTCATGCtmTTAACm 
TCT[^(ElTGGAAGTTQ<KjAAAAm^ 
Vp <Ht lATt ITTCAACATATAGGAACTGAGGTCTT^ 
GOT fcTGT 3AGTjWTTCATCAlpTnTGAGf AACHT^ 
TGA 2AAA GTTG iCGAntnttTAAACTAGTATTTACAAGAG 
TAApGG^ CTCG ^GCAGGACCACCACCACCAGTU A 
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O.CKE44 "homologue of SEQ .. ID NO,. 4,0." 
A,' XjATtTTAATTACAGGGGCAMTGCCCAATTAGGMCGGAACT^ 
ki idbk rACGT^QAOTAGATGTGGCTCAGATGGACATTACCGATGCAGAA^ 
KJ IGXQ STGAAAiXGACTnAGTCTACCACTGTCCA^ 
G^kAAA lAGTTOGACTTCGCeATCAATGTGACGGGGACAAAAAATGTCG^ 
6l GCAli LCTCTAGmATATTrCTAaJGACTATGTCTTreACGG 

A< ITTG 1 / .tgaccqaccagatccacagacagmtatggacgcactaagcgtatggggqaagagttagttga 
q; AGG itgtgtctaatttctatattatccgtactc 

ti acca tgcaaaatcttgcgaaaactc^taagactttaacagttgtaaatgaccagtacggtcgtccgac 

TTGGA< TCOTACCTTGGCTGAGTTCAT^ 

TG TOA> atgatgcgacagaagacvvcaacatggtatgattttgcagttgaaattttgaaagatacagatgt 

CC AAG 1 CAAQCCAOTAGATTCCAG/TCAATTtCCAGCCAAAGCTAAA 

ci ggcc aaagccaaagctactggatttgttattccaacttggcaagat^ 

■ AA G AAj< ITXjAGACIXXJAGC^CCACCACCACCACCACtGA 

2CSE45 ' "homologue of SEQ. . ID . NO. 41" 
AT G WCGTTjCTCTCGACTCTAGAGTW 

or ggtc gctatctatatagccgttagtcatgattatcccaataatat^ 

CG COTC GATTGCCTreGGGCTTGTGATTGGm 
TG ACC£ txrTTTCTATATATTrrACKiCTTGGKSACTTATGATCr^ 
XT SCAT ;AACGGGTKXXttAAAACTGGGTAT<^TAAA 
GA AGA3 ATCCTATATCCTCATGTT<KJCT(XjTGTCATTO 
CG SACCGTTCCGCTGGACTITnX^ 
GC \CTT >AAGT0ACTraGGGACGGCrnX3GTTrrrGtA 
Gp rrrij TGGAAAATrATTATCCCAGTATnGTGAC^^ 
Tfe OTA< iCAAGGACGGACGAGCTTTTCTTCACC&GATTGGAATGCCGACG^ 
,-^» aa^TTG SOX^TCCCTTroAOTTTCCCX^AACA^ 
^> CG jAK3i (SGTCGCnATTTTCTCAGaGA^ 

GA nTt ACGGTrATTGCAGAAGATTTTGaCmATrGGCTCT 

GA rrr^XXJTATGTCGAAGATTACTCTr^ 

TTy iTGA' X3TTOCTCtKXL\CATCITTGAGAATATCG 

cc< nte< ctttcatttcgcaagckkkjatcagctattatc^ 

AT< )AbT rACGfpGACTAATCTAGCTGAAGAAAAGAGCGGAAAAGTC 
TTV AAA* JAAAITAAACTCGAGCACCACCAC^CCACtACTGA 

■ ' . . . * 

£C¥]!4C "homologue of SEQ. ID NO. 42" 
AfCKKSAJvAAATCATCGGAATt^ 
AG< JAAG 3CTTTCAAGCAGTXX3ATGCCGACGCAGT^ 
TGJ .GGG' XrTAGTACAGCACTTTGGGC^GAAATCATTCTTC 
GC1 'AGTC TCATGTTTTCAAATtXTGAAGAGCAAAAATGGTCTAATCAAATTCAA 
AG( iMC IGGCTACnTrGAGAGMCAGTTGGCTCAGACAGAAGAGATTT^ 
TTl K)AGCAGGViCTACAGCGATTGGmGCTCAGACTTGGTTGGTCT 
GAl .GGC TAArcAAAAGGGACCAGTTGltXJAAAGATGAAGCnXjAGTCT 
f AC AAAi AMGAAAGATTTGGCCLVGCCAGGrrCTTGATAATAA 
AA( TGO TATCCTTCTTGAGGGAGGTAGG<^GAT(UCAGAGATCTCGAGC^ 



I' 1 



"homologue of SEQ* ID NO. 43" 
ATClAGA^AAAtTOrrATO^ 

3t rGCCCTAAtTCXJAGCTATTATCTTGGCTGAfG^ATGTC 
CGdAljG? AGC^kGTCTTGTGO A AATG ATGGAATTGATGGGA G CTACTGTTAAGCGTTATGACGaTOTATT 
GGA OATt GACCCAAGAGGTOTlX^AMTATTCCAAt GCCITATGGTAAAAtTAACAGTCTTCGTGC^TCT 
TACTATt TTAT^GOAGCCrCTTAGGC<XJTTTTOGTOMGCGA 

i rCCTi fttCOGATrcACTrACACCtTMGGCG 
GAT K ACA rcAAGTTATCrGCflAMGATACAGGACnTC . 
TGGpAGC^CG^TtMTACGAtGATTGCTO 
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O X»TO UVCCTQAGATTATTGATGrAGCTACTXJTCTTGAA 

a n'iU r atgatcattattgatggto 
r GAA k^g^mcatatatatctitagctgctgcagttggtaaa 
g^acA :ctgc3aagggttottgctaagttggmgaaatoqgagtg^ 
tti gtcc^ggaacagtctaatttgaaagcaatcmtattaa 
>t tgc^c^ccgcttacccctctm 
a laaacgtgtaaatcatgttntgaactagcaaagato 
.-^ ^t a tti tgtajcacgggtggacgtoamacgtggggccagtgttaaagcgaccgaot 

^ TC fCXC VIGTwTTGCTGGGC^AKKjCTOAAGGCAAAACT^ 

Gi iTTA' TCTGATATTATCGAAAaATTACGTAATTTACK?AGCGGATATTAGA 

^(X^QIACCAdCACCACTGA . 



Co G' 



law 



O 



'horaologue of SEQ. ID .NO. 44" 
A.^1TC^GAATTGAAtTTrc 

T1 TGAJ TGATAAAGTAGCATCTTATCATATCGATATTATGGATGGCCATTTTO 
CI CCTT jGTT£ATTGAAGAAGTTCaaAAAATTAGTG 
a: AAC< TmCK^TAGATCAAGTTCTCOAmAC^ 
1C GTCI rGCTtTTCGTrrOATtGAf AAAATTCATC^ 
A/ ACA< !CTGTTTCTACAATXnTrcCCTACATTGATT^ 

TTTT 3CAGGACAACGCTrTTTGGAGTCTACCrTGTATAAAATC^ 
GA At (k TTAT^CTACaTCATF^^ 

OC AGO; .ccagatat^atcttatackjtcgcagtc 

CG GAT A TCTOTTCTAGaGATTAC^ 
^CACCACCaCTGA 

2CFE49. "homologue of SEQ. ID fJO. .45" 

at 3 ag^tatggctttga^gca^ 
capita :aaa}\gcaggagcagaggcagcaaactaccc^ 
tg 3aag ttcx1agat^acgccta<^aaaactaactgaaatgataactccta 
ca rrro latttacagatattgcackjgattgtaaaag^gcttcaaaaggagaagggct 

TCnrpGI^AATATTCGTGAAGTAGATGCCUTrGT^ . 
CG XtAQ IMGGAajTGAAGACGCCTTTOTAGATCCACTTG 
Tt< :r^ClX3ACj^AGAATCAGTGAACAAACGATATGCGCGTGT^ 

at. iaag \a7x$gtagcagaattcaato^^ 
ck :gta< :catt?gaattaacagatgaggaacaaaaggttgtcaaag^ 
.m> ag yen tatgtacctaat^ggacgaggatgtcgtttcagaacc^ 

H. AT X&TT iAATTJjtGCAGCGAGAC^AAATCCT^ 

CT< tAATT GpATCATCAAXSATAAAAAAGAGTTTCjTTO . 

GT GACiC rCGTGCAGCTTACCACTTGCtTGG 

G<T TGG> ^CTTTICAAACGTGGTATGAAGGCrccrCAAGCAGC^ 

G<£ TTA1 TtXntJCAGTAACC^TGTCATATGAAGATCT 

AG< TOGi LCGCinrGCGTGAAGAAGGAAAAGAATATAT^^ 

AA' GTCC TCGAGCACCACCACCACCACCACTOA 

■ TjCFISG- "homologue of SEQ. ID NO. 46" . 

atqgaMtogAaaaaaccmtcgtatqa 

jtfTAT|ATCGA>GCT^ 
BCAGGCTOTCTATGACAArATCAAGre^ 
rACTXpGGACTATAtroTCCGCAGTCAGATTm 
30VG|a<&AGATAGAAATT^^ 

1CF}B*1» ^'^omoloque of "SEQ. ID NO. 47" 
ATG^CXrriTAGMTGGGAAGAATlT^ 
GTG 3TAT rCGTiUGCAATAIXXJTAAGCAAAA 
" lATTpAGAGCATCAAAGA^A 
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dGGAkATTGCTGQCTTACGTGTGATCGTTCA^ 
A< 'AAG ^xWGGATATGCGAATCAfACAGGAGCGAGATTACAffi 
ff "TCC] aTCaTGTGGTAGTAGAATATACGGTTOATaCCATCAAT^ 
Ti AAA' tcGTACTTTOGCCaTGAATTTCTGGGCAACGATaGAACAT^ 
li SfrK c^GATOAGATCAAGAAGCGACTGGAAATrACAGCTAGAATC^ 
- GGG Y^ITCGTGATOATATCCAAGAAGCec^GGCA^ 
Aja&^TAGd^CAGTOAC<5ATACAGATGAAGAATACAGOCTC^ 



1 

^ 

, ^ GijiltT^CCAd ATGAC 

H; A 



/"homologue of SEQ. ID NO, 48 

^ ggaActtaatacacacaatgc^ 

J ^ ^QAGArrcCCCTAGCA 
^ ^ ^ WGCC^TACATCAGGAAAACXrra 
jCGCTTTGTGGATGTGCCnXjGTTATC 

gIg^tgattgaggagtact^^ 

;Ica HUCrcGIX^GCAG ATGATGT^^ 

GTY gcgawaaggcggacaagattcctcgtggtaaatggaacaagcatgaatc^ 
^ttaaactttgacocaagtgacgatttca^ 
c^gc atgcaatcitagaa aaattggcggccgca 



CCGTA^GAA' 
CAaOATG 1 



Q<G! 
TCG' 

A/ATXWAA 



Is 



.41 

NO ' A' 



if 



I> ATC 



•I ! 



?0*£53 "homologue of SEQ.. ID NO, 49" 

, A saaaacaagaaaaatccctttgcgcaa^ 

^ TC XiCA iTGTOAAGAACAAGQAAGGACAAGTCTrrATTQATC . 
csv * r/ffA rCAAACHAGAC^ 

l^CC VTG£ AAGT^GAAGAAAGCTmATOACGAGn*GATCGOT 
AGfrrG^AOTGAACTCGAGC^ . 

1Cpfe54 "homologue of SEQ.. ID NO. 50" 
At jTTAJU^CCCTCTATTGATACCTTGCT 

AK iACC rGCCCACGAATTOGAAGCAGGlXKXXXAGCAACTCAAGGTT^ 
C5 :aCT [TAGAAGAAATOGAAK^GGAAAC^^ 



ATO' 
TXlCiG 

t4>a< 

TOC GTGT 
TCC CAA< 



GT< 5CG7> XjCCGTATCGAAGAAGAAAAACGCCGCAAAGAA 
TG< rTAA IGAAAAAGAAGATGGTGAAAAAATTGTCGAGCaCCACCAGCAOCACCACT 



5$ ! "homologue of SEQ, ID NO. 51' - . 
TCAlTAACATtiAAAACAACGT^ 

kJggA iaaaitggagtcaaoqaccaaatcaaaaccagottccgtcaagct 

OQ' tactctcttacaaMcacagatgaaaa . 
GTOl gimtacagtcgaaaaaataggacgcatcttc 

X lATTrcTAAGAAAGTCAAAGAAATCCTCGAGCAC 



*zcms6 

AfCOC 
AG( ICGdi lGG 



•homologue of SEQ. ID. NO.. .52" ■ 1 . 

ait)g^gattoaaaattatataccagatt^ 

lgggJ^tcaaggctottttggtotatt^ 
aadgco .gagatgaagcaatxkjctacatdacot 

AAC ACQ AAAAACGCGTTCAACXjAGCAGTTGAGAAAT^ 

cqi pcac \tttggtattgaccqtoc 
cg^ ccac ctxAtcacagatatao^ 
gtc|caa^tgachcaatcaaaacgca 

vttacatataaaaaaggaattctcgagcaccaccacca 

ID NO. 53" 



homologue of SEQ. 
nTCpAAAAATTlTAATIGCCAATCGfGGTC 
GGC G AH G<»^GGtAGGGGTltATTCMCTXKTC 
GCA^rTT 3TATTGGTeCTGGCAAG^ 
CT^ACt^GG^GAAGCTATtCACCCTXKi 
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Gl SAAC lAAGTAOGTATCAAGTTTATCGGTCCATCTGGTCATOTTATGGATAtGATG^ 
TG C<9CC tGCTCAGATGATTAAAGCAGGTGTGCCTGmTACCAGGTTCAGATGGAGM 
QJ\ AtjAJ XXTraGAlTGTTGCTGAAAA 

Gl AAAC GGATTCGTAAGGTTGAAAAACCAGATGACCTCGTTTCTGCCTT^ 

CA \QGC caattatggcaatgGtgccatgtacatagaacgggttatctatc 

CA AATC GTAGOTGATGAGGATGGACArGTGATTCACrTGGGTCAACGGGATTOTTCT 

AC CAAA AGGTTTTGAAAGAAAGTCCCTCGATTGCAATCGGAAAAACGCT^ 

CT }^TG rTCG^GCGGCAGAGTTTGTTGGCTATGAGAATGCAGGAACGATTGA^ 

AA GTAC AAATT^CTATITCATGGAGATGAATACrrX^ 

TC lGGT3TTti£TATCGTTMGGAA^ 

AT \T*TC rcCTACX&GGTCATGCCATCGAGTO^ 

AG TCCA GGTAlAGATTACTAATCTCTATCTCCCAAGTCGTGGAGTTGGCTTGCGCG 

AT ^GGTrAtACCATTCCGCCTTATTATGAT AGTATGATTGCCAAAATCATAGTACACGGCGAAAATCG 

TfT iTGA ^GCCTTXiATGAAAATGCAACGTGCCCT 

GA niTC 1AGCTTGATCTCATTTCAGATCGCAATGTCATT 

A)\ CCTT nTACCTAAATATCAAGAAAAAGAACTCGAGCACCACCACCACCACCACTGA 

1 CB ssfek "homologue of SEQ. ID NO. 54" • 

Atp^T^ACAM^ 

'AG VCATCtfATGCCAGCTCAGAACITO 
ISPA 3TACAATATCGAGTaTATCGAACTCTTGTCTGAC^ 
*Jnc< JAAATTACGGAGTTCCTCGAGCACCACCACCACCAOCACTGA 



'homologue of SEQ. ID NO. 55" 
AT< SAAGDATAOATATATTTTAGCATTTGAG ACATCCIGTGATGA 
Afi< JATG \GCTCTTGtCCAATOTCATTGCT^ 

LA(3T VGCCAGTCGTC^CCATOTCGAGGTCATTACAGCCTOT 
GA' rmc< ttAAGACaACttTOACAGCTGTro^ 
GG' TTC' X^GCTGCCAAGGCCm 
GC LCCT< JATGGCAGCTCAGAGTGTGGAGCCTTTG^^ 
AG lCAG IGTTltKnWATam 
. T<3< rGGA XJCTTATGACAAGGTOK)TCGTO^ 
fc. CTC GCnX ATCXaGGGCAGGATATTTATOATTlX^ 
- CCj TCTC AGGTTTGAAATCTGCCTTTATCAATCTTC^TC 
TA< AGAu iGATTTGTGTOCTTCCTTCCAAGGAGCAGTTATC 
TTC GAQi uVATATOCTGTTAAAAGCCTAGTTO^ 
Gffl rrAG< !AACTGAAATCACAGATGIXlAATGTTATCATrCCACCTCn^ 
TAT G^TI GCTTATGCCAGTGTCAGCG AGTGG AaCAA^ 
CO AGTk TTOCCtTTGATAdCATO 



A 



^CBSfiO' ^'homologue of SEQ. ID NO. 56" 
ATC TGT^OAATTCTIXk3TGTTGTTGGAAACACAAA 
TTC \ATA CCGTGC^AlXMTTCTGCGGGAAlTTrro 
GTI QGTC GTATraCAGAATTGTCTGCCAAGACAGCTGGTGTTO 
GTTMGC AACTj^TGGOAAACCAACGGAAGACAATC 
TCfT rCGT rCACAATGGGOf GATlX>AAA^ 
CfilA AGGC <:AAACAGATA03GAAArcGCCGTACATtTGATTG^ 
S£ AqiTOTSAAGiCXriTTAAAAMGCTCl^ 

AAA ATCC AGATtoTCATCTATGTAGCGAAAAACAAATCTCCA 
TAT 3GTC raCT^GATOCTATGGCTATGATrCGTGAAAC 
3TAA KJGl^AAGGCroATAGCGTGGAA 
TA npc 3CVUCITCACTO 
TGA 3CAA OCAACTGmtGOTTAAACTCATTCAAGOT 
CCT X:TA rCATflAACKK^ 

GAG 3ATI rGCT^CTAAGAAAATGTTGOAAGAATTOACAGATAC^ 

GTG 3GGC rACGpTATCKK^CTIlCTCA 



42/60 



WO 01/49721 



PCT/US00/35604 



a. 



*<jGArAGTCGtCAAGTTTTGGTCAAGGCTAATGAAATGGGAAT^^ 
CAGGT rCAACCCTCTCACGTGMGCC^ . 
A' rCAA ITAAAGCCTATACAGCGCAAATCGCAGCCCTrGCCTTC 
G JTAA rGCTAAAKGCAAGCCmGACCTGGTTC 
C XXm C^G/GAAAGAAACCATTGAAGCCAAGGTTCGTG AAOT 
Q iTCG< JACGTWTCAAGATTACTACGtAGCCATGGMGCAAGTC^^ 
Q X3TG' XjAAGGTCTTGCGGCAGGAGMCTCAAGCACGGAACGATTGCCT^ 
G' CtTC GCTdTCTTGTCAGATC^ 

0' GGT< 5CTAAGGTCCTCACTATCGCAGA AGAAAATGTTCCTAMGATACCGACGATAtCGrcCTrACGAC 
H TAO .CCCTTACCTXTIX^CCAATTTCAATCGTCGTAC 
M CGTGGCCTCGATGTGGATAAACCACGTAACCT^ 
ACJGACCACOVCTGA 

£CI£$l "homologue of SEQ, ID , NO. 57" 
Kt GATACGTATCGAAAATCTCAGfTGTC^ 
TC CATC GACCAACAATTACCCGCATCATTGGTCCAAAC^ 
Qi TQQ( rAATTATCCCACATCMGGT^ 

AT TCJCC TATGTCOAACAAAAAATGAATATCGACTACAACTTTCC^ 
TA GGAC TATTtCCCTCTAtTCCTCTCTTC 
OC ITGAAATCGTCGGCCTAGCTGAeTAa}CTGAACGrc 
CC GGTC nGATTGCfcAGAtOTTTGOTGCAGGA^^ 
TC ACTC rGTCAGTGAGGAAATCATCATGAATACGCTGAGAOATT^ 
6* TCGT rcACCACGACCTCAGCAAOATTCCCCACTACTTCGArc 
A1 rGCCrTIXjGTCGAACAAAAGAAACTTTTACCGAAACCAATCT 
TtT tTCA ^TGGXcSGTGACCtACTCGAGCACC ACCACCACCACCACTGA 



.2CFE62 



TTC r ( 



5G 



CCfl 



2Cr€ GO 



t ^homologue of SEQ . ID NO. /.58" 
ArtKXCkAAGAAGTC 

GA rGTT :atttigtccataaggcc1tggtcw 

CG <GCC rTATATCATTCACOCTATCCAAGTGGCAGGTATTTTA^ 
GT IGCT rGTCOAXTCTTGCATCATCTCGTGGAAGATACAGATGCGA 
TT< JOTCI rrGATGTGCGGATCATTOTrGACGGAGTTACC^ 
Gfi \QGA ATTAGCGGAAAATXXATOjCAAGATCCHXI^ 

aaHctcItctgAccgcttggacaatatgggga^^ 
ac^tggaaatctatgccccgc^ 
ag: vaga ctrotcrrrccGTtATCTCAAfcx^ 

b?< :aGG ) AG03tX^GGCCmK3TGGATaAGGTA 

raaa^ jggaagamatggtcgtcccaagcatatttactcaattto 
$0bt]tgag0aaarctatgatctgattcctato 

ttacgtgcatgaatrn^ggaaaccgatcccaggtcgctt^ 
$cc^tggttAtcagtctatcx^ 

AA< :CAA< KxAAATGCACGAGGTGGCTOAGTACGGGGTTGCGGCTCACT 
GG<fGCA WITAACAGCAAGGMTC^GCTArTGGAATGAA^ 

GGC GATGATGCTAAGGAATTTGTGGACTCTGTrAAGGAAAACTAT^ 
TTTlACCC CAGAlGGAiSGTGTCCXinmnTCC^ 
TAC CAA< GTCGGTT5AAAAAGCAACTGGTC^X2AAGGTCAATC 
AA> GACi LGGGGATCAGGTTGAAATrAltXfcCAACCCGA^ ' 
ATC GTGd AGACTAGGAAGGCGCGCAATAAGAnt^CCAGTTXnTTAA 
GTqAACyUGG^atGAGATGCTGATGGC^ 

_ i ^WTGGATCAAGTIXnt^ 

TGGfTTITflGGGjiAATCGGTG^ 

LGGOAAGGCTGAGGCAGAGGAiKnTCIO^ 
AACKttCAACX&CAAGC^ . 
AAC IGTT jTAACCCCGTGCCTGGTOACGATATTGTrc 
ACC 3TGT 30AC|IGTATGAACCTGGGTGCCCAAGAAAACTACGAGG^ 
AQA CCAG TACTCTAGCTCAAATAAGGAGTATATGGCCCATATCXjATATCTACGG 
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<^C?nbAAciGLATOTACTGCAAQTTCTT^ 
qCAAC GATi^lXSAAGTTIXjCTAATATCCATC 

A rAAAATTAAGAOTGTGCCAGAAGTTTACTCTGTCAAACCGACCAAC^ 



crcG^ 

2C$EU "homologue of SEQ. ID NO* 60" 

aksacagaagaaatc^^ 

r iAOA jGCTQfTCGTATGCGTCCAGGQATGTACAT^^ 

-ji rrocj jaaa^ttgataactcaattgacgaggccttc 

Q ^GCC\GATOArrcGATTACTC 
G [CGT X^TGCTGTTGAGACCOTtrmACAGTCCTT^ 
G 3TTT<^GGTGGTCTTCACGGGGTGGGGTCGTCAGTTGTTAATGrc 
P :CATL .AAAACQGTAAGATTCATTACCAAOAATACCGTCXjTGGTCATO 
IX H3AQ ^TACdGArAAAACAGGAACAACTCTTCACTTCACACCGGA^ 
A iTCTTGATtnGATAMTTAAATAAACGGATTC 
C WTCi CTOATMaCGCCAA<kimGGAACAAACCAAG<Um 

c< rridj atatatcaacgagaagmggatgtaatctttgataca^^ 

(?J kTATK JAC^GTTGAGGTAGCCATGCAATACACAAOT 

A' TltlX atacacmtgaaggtgcliacgcatgaacaaggt^ 

Tl ATGC TCGTA AGAATAAGTTACTCAAAGAC AATGLAAGACAATCTAA 
<S TAA< TGCaGTTATCTCAGTTAAACACCCAAATCX^ 
AJ JACl( !GAAGTGGTCAAGATTACCAATCGCCrCTTCAGTGAA 
>V 0< CA& TTCO^AMCGTATCXJT^^ 

tt oc gtg> agtca^cgtaaaAaatctggtttggaaatto 

It TAA^ AACttSTGCTGAAACAGAACT^ 

ClCOT^ACCXSTOAGTrrCAGGCT^ 

<JC ATAJ G ATTCtAGCTAACGAAaAAATTCGTAGTClTO 

XT rGAT 3TTTCGAAAGCCCX3TTACCAAAAACTC 

tt 0>ta xcttcttttaacctk^tttatcgttat^ 

oc 2aa$ saccaatctatggtgtcaaggtlrggaagcgac^ 

aa qaa> tcaaactccaagaagcittagccxgtta 

ta \ggfc gctaggtoaaatgqacgatx^tcagctgtc 

cg 2tac agttkntstagatoatgctck^gaagcagataaaat 

GA 3CCt XSTOTTGAGTTTATCGAAGAAAATaCTarCT 
AC^CCXCTGA 



So 



2 CPiH 



'homologue of SEQ ID. NO* ; 61" 



ATI JGCAtTTTACTGA^ 

TOi kCAO iTGTjrrATGCTTCQttGAACGATAAG 
GA JT^G WUCCCTGCCTACGTTCCTOT 
GA IATC JTtX3AGGAATTGGTTCGCTACTATCT^ 
Aa^CCjACKty . 
. * • * . •■■..*■ ' • 

aCWW i.nbmolpgue of SEQ. ID NO ^.62" 
ATQGtC^AACTATCCAGATAAAGTITC^ 

AGGAATGTCTTTTGAAAAGATGATCAATGCTACCAACGACT 
TOTtTAiTA CATAAGAAACC^CTCCTATT^^ 
ATT GTTG /^GCCTATTfTCGACAAGOT 

Cltr TQAk GTCAA<3GAAACAAAACAAAAAa5ttKiGATrCCGATC 
,<*V CA*7 ATjGC AACAAGlXXnTGCOGAAGAAGGAATCTGCTTO 
CTA CTT4 rTGCOGGCATTC&AW 

AA1 ATAT FCGA 3AATATGGATATQAAATCAAGGCTCGTGCCCT 
AAAQAA^ATTTt^ 



2CFE>7 /"homologue pf 

■II.- I ■ ? 



SEQ. ID.NQ., 6^' 
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ATfeGCtbTATrtAQTAAAAAAOATAAGTATATTCGAATCAATCCCAATCGlTCCGTTAGGOAAAAAGCTC 
M 3CTMGCciGAGGTTCCAOATGAATtATnTCCCA 

GG Sc? 3^AAGTQAACGTATCTGTCCGCACTGTAGCTATACCTTTCGTATTTCTGCCCAAGAACGCTTGG 

"|A< X3ATTGaTATX3GGAACCTTCAAAGAATTGTTTACAGGGATTGAAAGCAAGG^ 
CCfcTCG rTACCAAAAGAAACTGGCATCTATGCGTGAAAAAACAGGTCTGCATGA^ 
AA ; TCC rCTrATTAAAGQTCAGACTGTGGCTCTTGGGATTATGG^ 

CT, ICGG [TGTAGGTGAAAAAATGACrcGTTTGTTTGAGTATGCQACTGTCGAAAAATTGCCAG 
AT CAC .GCCTCTGGrGGAGCCCtfTATGCAGGAAGQAATCATG^ 
^ GC WJCG 3TTAJ\ACGCCATTCA^TCKHXXJTCIC^ 

*> cm ;tga ^gcttctttcgctatggaaggcgatatcattctggctgaaccacagag 

GG 3 crt ^GTGTGATTGAAAATACGGTICGTGAAAGCTTGCCTC 
TA< JAAC vtgGCTTTGTGGATCCTATTGTCAAAAGAAGAGACTC 
AF XSOf GGAGGGAGTCCTAGACTCGAGCACCACCACCACCACCACTGA 

£ Cfflk6$< "homQl£idue of SEQ.' ID NO. 64" ■ ^ " _ ju u -ra 

ATi JbAKTTVflGGOATIW 

CA< 3Ctd u*GGGCTTOAAATCATCCaGATAAATGAaG^ 
AA' TOG' TX?AtACTTACAAGGTXKJAACGATTrGTAGTGGGCTTC 

AC( IGCGl IGTAGAAGCTAGTCAAGCCTACGaAGCAAAGCTAGAAGAG l I J 11 iCOTTTACCAGTAGACTAT 
CA.< iGAT JAAOGCTTOACAACAGTGGCTGCTGAGCGCATGTTGATO 
AA.< 5CGC lAGAAAGTCATTGATAAGTTAGCAGCTCAGCTGATTTTA^ 
TC< AGCaCCACCACCACCACCaCTGA 



o 



2gF1I69« "homologue of SEQ. ID. NO. 65 . 
An I ACAMACTTACTGTTAAAOACOTTGAm 
rA ( CAT! OAAAGATGGCGTAATCACTAACGATAA^ 
CA: CGA> CAAGGTGGACGTGCAAlTCTTr^^ 
GG' 'AAA' CACT^CTCCTOTAGCAGCTGACTTGGCACK^ 
CA< TCGfl GGTGCTGAATTGGAAGCGGCAATCAACGCTCTTO 
AC CGin AiCGAAGATCTTGACGGCAAGAAAOAATCT 
TC/> CTTC GAGATGGTATCTTCGTAAACGATGCATTCGGTAC& 

TGTC AGCAAACGTTGAAAAAGCAGTTGCItKJTTTCCTTCn^ 
AGciAtTT GAAA^GTCCAGAACGTCCATTCGTtoGCTATCCTro 
GTIATCX AAAACTTOCTTOAAAAAGCltiATA^ 
AAC fCA<i AGGTATCGAAATCGGTAACTCACTTGTAGA^ 
TO/ AAA; GCAAATGGtAAATTGATCTTC^ 
OA/ GTG< CTGACACTG AAGGTGAAGCAGTTTCTGAAG 
TCC3 CCA/ ATTTGACGAAGCTTTGACTOGTGCCAAAACAGTI^ 

aa/ ccti gatttcc/^gctogtacaatcggtgtgatggacgct 

AA1 CATC GGTGGTGGTGACtCAGCTGCCGCAGCGATTAACCTO 
AG7 AOGt^(X5TGGAGCAT^ 
AA/AAd 



dtCGACK^CCACCACCACCACCACTGA 
4 CFEfrO 1 "hgrnologue of .SLEQ. ID NO. 66" 

atg ttaaIaatcagaAaj^caatcacgttatcaaatgttaaa 

AAA CCAfl TGTTiTGGCTAATCTTTCCAACCKX^ 
OCA GGCI TTTATTTGTIXX3ATGGAAAGGAATTC 
TAT IXJCA ^AGGCAAGCKJIXnlTOTGGT^ 
AOG AOCT ATGTOAACTATATTTCrrGTXU 
Alt; 3TCA 3TrAtfTT6GAGTTCT<K3ATCT^ 
TTA [TTG< jAAG^TTTGTCGCTATlTrGCTrc 
GCACCAOCAGCACQAOCACTGA 



0- 




acF#l' "homoj-ojgpe .of. -S5Q.. ID NO. 67 
ATG \QAAfrO3AACTATIX3ACT0GCTTTA 
AAA^qiTX^^ 
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fee J**) 



it 



TACGGftAACAGTTGGGACTGGATCAAGTK:^^ 



, im, "CATCCCTGAACA.CCATCOTCTCAAGATGCTTGAGTTGGCAAnXiAGGGGATTG 

li fAAT 1 ! t^GATACGGATTATTACTTTATCATTXGTGCCGACATGGTTGACT 
. a) XXOi TCAACIGGTTGACATGGrrCAGTTTGTC 

AC iGrc(SAAACCCMCTTTCTCCTACC^ 

G4QCA( cacxjagcaccaccactoa 

a<!lte% "hotnologue of SEQ .'. ID NO. 68" 

^GAATlATTCCAAAAATAQTCAOACUAGCC^TQAGCAG^ 



CCCAT 



TAT 

ccr< 



^' .Qi 



cr>> 



TCAI 
CAO 



GGCTA? A< 
CGM 

£CFS75 



TTGGrTGGfcTTGGAGACCAAGCrGTAACAGTGGTrGGTATCCAAAAAGGCAA^ 

rdA a * cggaattttgoccaaccacatccag aaqgctaccg aaagoc^ 

tga^ATTTGCHEGTCCAGTTCTGACC^ 

W ACG1 GCTCAAGGGGAAGCTATCGCrca^TCTCATQOAA ATClAGTGACCTGAAAGTTCCTAmTCG 
nVlTCGOTGAAGGTGGTTCAGGCQGGGCTCnXjGCTCTAGCTOTC 
TTCTATCTATGCCATTCTCAGTCCAGAAGGCTTO 
C^TG^VAGCAGCAGAACroATGAAAATCACrrrCGCATGAACTC 
TG \TTTrrOMGTAC5aACTrrcTAGTAAAGAACTAATrAAGAGTGTCAAAA^ 
'"^TAf ACTITCACAAAAACCGCrAOAAGAGTTdCTGQAAOAACGCT 
GC^CCACC^CCACCACCACTGA 

homologue of. SEQ. nTNO. ?J" 
ArGTCAGATAAGAnGCCTTATT^CAGGCl^TTt^ 
AA 2GGG CC5A^CAGACTTTTTGATAAGCTrrATOTGGGTATr^ 
££ <A TC < jAAAATOitAAACGGGGCCTAGAAAAGGCTTT^ 

7C 'CAT( ^TGAATTGGTGGTCC3ATGTTGCAAAAAGATTGGGTt3CTACTrGTCT OAGGA 

Alt JCGTi xkjatttgcaatatga\agccagttttoattact^ 

AT TAT1 TACATAGTCXSACCIX^CATtnCTAf ATCAGT^ 

GGA rATTGCCTGCTATOTTCCCGAGAGTATTIG^GGAMGCX} 



(3CACXJA 



O.C¥t16: "homologue. of SEQ. lb ( NO. 72" . . '. 

MATTfrGTTTCTGGTTATCAGTGCTT^ 
ACAGCTi iAAAGTGAAGGAGAAAAATTAGCTC\GCAGTATGCAGGATTAGAGCAGGCTCATCAGGTrGAT 
TTA TACV ATOGCTraGAATCTrATTACAaCGTrcTtGGTCGTAATAAAC^ 

CJA' TCG' AAAGATOACX^TMCATrrACXntTATC^^ . 
AA< :COT TCTAAGGAAAAGGGAOCTGGCOAOATTGACAAOATAACCTrrGGTC^ 
AA' CKH GTAGTTAAGTCAGGATCTGATTTTTATCTAGTAGATTTTCAAACAGGAGC 
GA< iGG6 TACfCGAGCACCACCACCACCACCACTGA ' 



"homolagvie.iaf .SPQ.* ID NO. 74 , 

tBtot^caatcgataaagaaaa^tttcaotttg 
_gc cagc atattcitactggaaatta 
ggc aat( trggtagccatcatttl^ataagtttcatct 
xc/ qcaa ggtaaacgactttagtgttcgttatatc^ 
^gctcmgacggtgtctggt^ 
ggttatcgcjtgttttrotoggtckit^ 

"ACAACGTGATCTXJAAACATCOCACC^ 

cG^acilCKJATirCT^ 

OTG KSCA AATC1TGCGCTATCGTGACTTGGAATA 
GaU oATI OTTOCCAAAMTATCATtKCTGMTTGGTATCT 
CM SCTT fATCTCATACGAAGC C rix;! UCL rCGGTCTTGGAn'ACXGATTACAGTGCCAAGTTTG 
GOT ^TtfGATJTCGGATTATTCACAAA^ 



A 

CGd 



ft TAACGGTjAAA' 
x CACTGA' 

ATG ATGtiAJV 
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2 CF6 7? (c****) 

,^ct TGT( ITTGGTATCCTTGTCCCm 
* ATAGAQrCGA^ACXiACCACCACCACCACTGA 

^ ecpr79 "homologpe of SEQ. ID NO. 75" 
0- AT( rTATiACCTATTATTAACCAlTlTAmGTATTAT^ 
^ AC< :AAA UCCAATCCAGCAATGTAmGATGCCAGTTCAGGTGATTT 
l£ Or TiTGV AGCTGTAATGCAGCGTTTGACAGGGATTTTAOTCT 

AO fGTij fTATCAAGTAGACTCGAGCACCACCACCACCACCACTdA 

2.CF1S0' "hopiologue of SEQ- ID NO. 76" 

ATC itttc otagaaataaattatttitttggaccacagaaatt^ 
AGi ICAG \tgg<1atctttgattaacccttttgttagcgtgct^ 

gg< Jens' CTTTATTATTTGACAAACCCTATTOT^^ 
TG( rTATf TTM^ACCTTGrGTACTTTCGTCTGKJ 

GA' TAA* rAGTTATCTAGTTTOArrATATCTAGTCAAACTAmATAGTCGAGTA 
AC' T^VTC TAATTATCCTGOSCTCCAGAATrrGGATGTAGAAG 

GT1 gw >ttcttcaaaatatcctaaatagcgtatcaaatAgtc 

°" TA< TGTI TTGA[rrTTXSATTATGACTCCAGlTrn^ 

JK ATX ICTTC AAAC^AACGATTCTAAAGAGGGATCGCrrGCATATTGCAGGCTT 
IN lTTG( TCGGTATAmGTGGAGriTCGATTGACGCAATCATrArAGGTTO 
^ Ad' ATT/ TTGGTTTA AAATATQCTlTAGrrrTTGCC^ 

GG( JCCA LGTAlTGGTTTGATTCCtATGATGATCGCAAATATATTC 
. AG' XjAt TATATOCTTGTTGTTCAGCAGGTAGATGGCAATATC 
TO J AbQ TdATCCAATCACGATTTTAGTTTTACTTr^ 
ATTGTCC CAGTGCCAACCAATtCTATCTTGAAAGAAATTTCT 

taj AATi atg^aagaacoagaaagagaattagctaagctc^^ 

ZCBin "hqraologue of S£Q. ID NO. 77" 



Oo 



AC TO 



A< 

GAd' 

GA^Afc}fllAar< 



GGTGGG 



CIA AGA( nrcnjAAAGAAGCGGTGGAGCAA^GAAAATAAGTCAC 
AAQGGG LAAMCCAGtGTTCCTAAAATCTTTGCCAAGGCTATC 
~ 'TTOC AATAACTGCTATATiTGTCAAGCAGTGACG<^ 

GCT] CTAATAAtGGGGTAGAT GAMfTCGCGAAATTOSTGATAA 
ClC QfTA FAAGGT1TATATCATAGATGAGGTTCACATGCTGTCTACA 
ACC CTGjC AAGAACCAACACAGA^TAGTCTTTATm 
CT^ TTCT iTCCCtoTGTGKZAACGTTlTGAGTITAAATC^ 
TAX ATOI rAGAAAAAGAAAATATCAGn'CTGAACCAGAGGCTGTC^ 
GOT OGAj XGOTGGACGCCTTGTCTATTTTC 
CTG CTA1 STCTGAAGAAATIACTGGCACCA^ 
^ CAACAG< ATGTrCCCAAAGCTTTGTCTTGCTTGAA 
H: TGTjOACC ClATCTITiXKiACTATTTAAGAGACnTGCT 
" C TCTTjrGTAGAAMTTO 

A CKIA^TaTTAAGTCTaGTTTGC^OCCCAAGATTTAT 

* AGTCCGMCCAGCTCTATCAG<^GCGGTTG^ . 

! TC ICAMCAaGAGCTITCTAATGTAGGT^ 
CTAC GGGCAAAACAGTCTATCGTGTCGATCGCAATAAA 
AAA TCCT GATTTAGCACGTCAAAATTTAATTCGtTTO 



j: CGG^pAAGCTCTGCCTCGAG^CCACC^ 



i ■ H ■ ... . .. 

ACBEjH "homologue of SEQ.. ID NOl 78"" 
ATC ITO iAT^CCAAf AAG 

AATXGT3GAG6AAGAAC^TTCAGGCTAGACT^ 
^ ATTATaj TCTCTATGC^^ 

^ GGG CTTG WGAAGCGT^TCTTATCAGTATGACCTTTAA^^ 
GAG CGGC TATC^GTATTGGAG^C^ 
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AUGTTf^GCiroGTTGCTACCTTCCAQAGGAAAGTTQTCAmCAGTGCm 
CC AGG< KTXUTGTTCCTGAATGCCCTTttf AATCGCCCGTATGA^ 

. CT/ GTd< rAGAGAMAAAGGT CGCTTCCTTCCTCTCCAAACCA7TCTTGGTTCCATM 

cr/ tta; ctkjcccttacggtaaaagatcctcttacagccttaacaaccttcw 

TA1 Clrn OGGACTTATCTCTTGTrTAATGCAGGGATTAGCGTrrrcCTCCIAAA 
A^LTtAdCAATCAAATAACCTCAtATCTG 
^ Ca XfM K:CACCATCGCTATCTTGTCAACAATGGTm^ 
^ AT. CCTt AQAAAGCTTTAAAAAAGTIXJrAAATCCTCATGATTTOKSGOTTTCAGGGG 
E A& IA0A nTGGACAMCTCTTGAGCCAGTTTGCMGTGACAAAGGrTATAAGATTAAACL\AA^ 
\A *r TCGlTACACnACrrTGCTGTTGCGAATCAAGAAGGAACCAAGl^^ 

M< XXJT|(JTCCAACCAAAAACAGrmCATGGTATTTGAC^AAAAGATTATGAA^ 
AAinrcmrrCTATCAaGAAATG^GGTCGGACTCrrTTGCAAAG^ 
TA; CHS AAATGATCATCAATTTTCTGTAAAAGAAGAATTTACTA^AGA^ 

.' X2AG' TTAATATTTTGACTGCTGATrACAATTACCTTGTTGTACCTGAriTACAAGCC^ 
: CCAC ATTCGGCTAlt^ATAATCAGTTTTACGGTGGTATCAATGTAAATGCCAGTGAAG 
^GqJ^CTGAGGAGTATGAAAAATACTTACAAAAGTTTAATGCTC^ 



ON 




A|GGTA.GCACTCTAGCAGATGCTAGTGCTCAGATGAGTGCCCTCTTT^ 



4 an 

ATC 



o 



Q 



S3 ^ 'homologue of SEQ. ID NO. 7.9" 

A M ; ^aatcacgcctatctcttttg^ggttt^ 

AG ClM ITrCTACGGATAAAGTTGGCGTCmCCATQKUaAAATGCCGAA^ 

AA] TOffl OCKJACAGTrrTCTCAAGCAGGOATrG^ 
A> AA* tX^TCCCAAOK^CiC^ 
* TTjT^ITtUCTAtKXJATGAGGAAAAGATCTTACOaAC^ 
^^'^^GAAAAACTTATCTTACmOT 

. 5 CJraM i"homol6gye of SEQ. ID NO. 80" 
* S £A£nAA^TTTCTOAAAA^ 
COG lAW rnTrrrGOAGGAAt<nTOGCGTAGAAGCW 

atc ntr rroTTCTTAAGqAaarx^ArraAC^^ 

^QOAtATCCmMAGCtKXJTOATIG^ 



CA.C M3GC QAAGCTAAATTCXX(^TGG(XAATCACa:GTATCGG, 
CACZACT JA 



AACATTTCTCOACjCAC^CCACGIC 



rj" 2- era IS "homologue of SEQ. 



ID NO. 81' 



rGOTCTATTTGGTAAAAAACC 
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XCAVGTGGATATTCAAGtGATTAGTC^ 
CG' ETCQ jMAAAAATCAATGATTTGAACGAGCCTGTO 
CA PGTG jTTGATCCTATTAAAAAAATAGAGGAAGAAGGTCAAGGTATTTCT 
AT< 3TTAJ UVACATOXAAGAC&TGCCAGCAC^ 

TT< !AAA' rCGAGGAAGCGATOAGGGA AGAAGC^GGCGCTGATGACCTTGAA^ 
AA \GTC lAGMCTAGAAGACTTGGGCTTGAAAGTTCLVAACGM^ 
AA jTAA rGGC^ATGTTCAAACGATTATTGATGAGATGGATXTITGAGGCT 
CC( 3T , fiO' "AGCATCAATCTACAAATTGACACCAACGMCCAGGTCGTATTATCC^ 
TTGAAG }CCTTOCAACTGTTGGCTCAA 
CA- VTOTl yOlT^TTATGTOGAACACGGTCCAG 



or 



V 



^ 2.CFpl8 



TGjC iAAGAAGGGCGCAGTCATAAAACAGATCCAATGTCA 



AIT ATT] CACGTATGGATGGCGTGACTAGTTACTCTGAAOGTGATGAGCC 
AricAO ^CtflGAGCACCACCACCACCACCACTGA 

S-CFjfcW t, homologue of SEQ . - ID NO. 82," 

A^TTTTQCCATTATTrTAGCAGCGGGTAA AGGG ACTC 
GclcM XjTTGCGGGTaTTTCTATGTTCGaACATGTTTTCC^ 
AC> iGT^bAGTTGTAGGACACAAGGCAGAATTGGTro 

, CTGAACAGTTGGGAACTCGTeATGCAGTTATGA 
[i ;CTTGGTCATTGCAGGAGATA 
colrATt AATCATAAAAATGTGGCCACTATCTTOACTGCT 
AH GTTC GTAATGAGAATGCTGAGGTTCTTCGTATTGTTGAGCAGAAGGATGCTA 
AJU OTCAj i.GGAAATCAACACTGGAACATACGTCTTTGACAACGAGCG 1 1 iXJ TLIGAGGCTTTGAAAAATAT 

oG ,TAC< aat^cgctcaaggcgaatactatattacagacgtcattc 

AAAGTrt iGCGCJrtATACTTTGAAAGATTlTO 

CTG/ k GTC\GTTATGCGTCGTCGCATt^TCATAAACACiTC 
AGAAG& U\.CTTATATCGATATTGATGTTGAGATTGCTCCGGAAGTTCAAATC^ 
AAi lGGCj lAAACGAAAATTGGTGCTGAGACTGTTTTGACA^ 
GCi <GCA< jGAGCGGTCATTACCAATTCTaTOATTGAGGAAAGTAGTGTTGCAGA 
CCI TATC CIX^CATTCGTCCAAATTCAAGTCTGGGTGCOT . 
AGC ! ATCT TCMTCCGTGAGAATACCAAGGCTGGTCA^ 
aac \qtt/ ATITICGGTCCTCKJAACTATTAGAGTCAACTATGACGGCA^ 
GAC fACA. iTGTCTnTGTTGGTTCAAATTC^ 

GCtl GCTC GTTQAACTATTACTAAAGACGTGCCAGCAGATGCTATTC 
ATA AAqi lGG AATAT^AACAC6TCTTCCTCATCATCCT 
CTGA . ' ' _ 

2.CJFE87.- "homologue of SEQ. ID NO." 83" 
ATC TCC^GAtlCTAGTATTTGGTCA^C 

CTA CCTT 3CAAAAGAAGCTTACGGTTTGGATACGGAAGCTGTTGCC^ 
GCC TTTG rcrrGAACTATTTTGGTCTGGAAGiCAC^ 
AGC AAG1 TATCTTGACTGACCACAATGAATTCCAACAATCTG 
CQC TOTT STAGACCACX^OOGIXmKSCTAACT^^ 
GTT jGAl CAGCGICrrCAATCGTrrAOOGTATGTlXIA 
CAC GlfT 3L\TGlCTTTCAGGTTTGATTTCAGA 
AA> ATC/ rriG^IXXTOAATTGGCTO 
AAC CTGC TACCAACTTOOCTaGCAAATCTGGTGAAO^ 

caa c6g> aataatgtccgtgttgcc£aAgtgaagac 

G AA ATT{j AAGCTGCAATOCAAGCTGCGAACGAATGAAACGGCTACTCrGAC . 
ATA rCQt -AACpCAAACTCAGAAATATTCGCTCTTGGTGC^ 

ttt< *AAA tTtgAaaacaatcatgccttccttc 



7TQA AAGQhTAATAaKrrcGAGC^ 



homologue of SEQ. ID NO. 84" 
ATphTTTCAAAGAGATTAGAATT^ 
GTGKCJCA lXKnyATCnGCCTAlXX3AGTTGGTTC 
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CO^GGAilGGTCCCTATCAGTCTGCGGTTAAAAATGTTGAGGCT 

OX irnw GCCaATOGCTTGGCAGCITITGAAGAGACTGX 

GG' tiGTl :OTITGATTGCTAGGATltrAGAAGAAGGTTrGGGGMGmGCT 

TQt AGC( ICAATMTCGTGAAGACGACTTGCGTAT CTGGCT AG 

M< 5AAT ^TTAGAAGAAGCTGQAAAGTTrTATGAQATTTTGOTGGTGb 

AGl :d^G' rGATGTTCGCTTTGGTCGCTTCTTGTCCAAAGAAGT 

AAj IGAA 3CTGAGAAGCTAQAGTTCG(XCTCGGACAAATCCCAG AAAAAAATCTGGAAGAACGTCAAGTT 

CT7 >GTA< * ata aGATTCAAGCTATCAAGG aggtgctccatgttagcaagctcgagcaccaccaccaccacc 

AC1GA 'I 

•on»9 



ATih 



"homologue of SEQ. ID NO. 85" 

iAATfTAA/LCGATATTAAAGACTTGATGACTCAA 
JfiQi K3ACGGATGAGWGCAGTTTAGCAAGAATG^ 
XC( CTCX AGCACaxnTCTAGCAAC^ 
AOJ lAGA VGTTOCAGCICCAGCTGLAAGGAAGTGTC 
GGJ lGTG( rTTTACTIGGCTGCTGGACCAGATAAACCTGOT 
V$T GTC AAA< :ATTQGTAATTATCGAAGCCATGAAAGTCA 
AAA :g£a UVTTCTCGTCTCTAACGAAGAA^ 
AA< ICfaCi tACGACCACGACTGA 



A7C AAA 

tttiggj* 



a 

* A' 



"homologue of SEQ. ID/ NO. 8.6" 
■(JtTA^CGAGTaGTGGT^ 

\& aTaGTTTAGCAACTGKjGaAAATCGGCATTGGT^ 
ATdtGC/ TAATGCGGCAGAAATCCAAGATTTTCCG 
TTT TGAf \ACTATTCtTTATATOOCTTGTATGCAGCCCAAGAGGCTC^ 
ac)c cici TAATAGGGATCGTTTTGGTGTTATCQT^ 
T CA QGTfi CTTCGCCTTCATGAAAAAGGACCCAAACGTGTCAAA 
AA7AfGqCTTCTG0QMTGTAOCCA 

.TGATGCGATTXKjGGATGCCTTC^ 
TGdOAG<tMCAGAAGCrrCTATCACACiOT 

OTTCiuTTCTAGAAAGTCtTOAACACGCT^GAAAAACGTC 

TGdm&GAXATACTTC^^ 
GCcUTC^L\GTAGCCTrGGAAGAAGCT 

ITGcdAATGAAAAAGGAGAAAGTGGTGCTATCXn'AGCTCT^ 
!/&GTCrmACAGGACATTTG^ 
GAXGfrmTCCGtGATAACTTTC^ 

TCT rTATGGACAAGGTTTOGAGAAAGAAATTCCATACGCTATT^ 
CACjAATqCAGTTCTTGCTTTCAXACGT^ 



CGJGAACTCC' 



£CrijM' ( 'homologue of SEQ. ID. NO. 87" • , 
ATG}MC/|TCTATGATCAAC^ 

TG FGGTTTCAiGACACCAAGCGT^ 
TAaJtA^ CTAOpOTGAGTATAAACAAGTCCTTCAAAATATCGTCGATGOOT 

GAt(}GGGACTTGGAAGAAATGGCCAAGCAAGAACntIAAAGATGC^ 
AAT ATGA AGAAAAACTOAAAATnTGClXXTTTCCiM 
AAT XGT jGAGCAGCTGGTGGAGACGAAGCGGGACTTTTCGCTGGAGA 
TAT 3GCG AAGQGCAAGGTTGGCGCTTTGAAGTCATGGAAGC 
AACflGGj TGCTATGGTTI^G<SrCAGTCTGTATA 
X?TG rTOCTOTCACAGAMGCGAAGGCCGTC 
AAtflTGAAG^ 

TGG TGO^ CAGAACGTCAAfAAGGTTGOGACTGCOGTrcG 

rAp KTQC AGGi^GAACaTAOCCAGCAGAAGAA<±aCX}AGAAGGCt ATGAAGATTATCCGTGCACGCGTC 
GGT jACC ^CTTTCCrCAGAT^ 
ACP ITOUlAACGGATCCGAACn 



50/60 



WO 01/49721 



PCT/US00/35604 




cqtcc/aaaactagatacgaititgtctgg 

y lAAAjjiACTAGAAGAATTAAACAAACrCGAGCACCACCACCACCACC^ 



o 

^ TA 



AAAAAI 

Abrcc/ 



•2<Ai9l\. ."hqmologue of SEq. ID NO. 88" 
Kl GGCOTAWCTCTrAAAGCT 

AC ACT1 ACGGAATTG AATACCAAGACGAGATTATCATC^^ 

CI TGCl IXIGTATCC^CTATOTCATTCCTGACTACTCTTACA 

71 ITAA rCAeACACGGACACGAGGACCACATTGGTGGGATTCCGTTCCT 

TA T7TA rGCTGOACCGCnTGarrTGGCTn<} 

CAAJ CTTTACGAAATCAACCACAACAOXSAGTra ^ 
ACTC ACTCTATTCCAGAGCCTTTGGGGATTGTC 
<W CTH \AGTTCGACTTTACTCCAGTTGGAGAACCT^ 
AA GGCC TGCIpTGTCTCC^ 

CG nO<S rcAGTCCATTATGAAGATTATCCAAGGTATTGAAGGACGTATCATCTTTG 

/JT ATCT pocgtctccagcaggcaacagaaGctgctgttaagactggacgcaagaw 

TJPOTAT JGAAAACKJCCATTGTCAACGGAATCGATCTTGGCT^ 

OA occ^ aatgaaatcaaagattatcctgcaggagaAgttcttatcctc^ 
cr vrbG ^gccctctctdjTatcgccaac^^ 

PCn[< TCTljcTAGTCCCATCCCTGGA AACaCTACTAGCGTCAACAAG CTG ATTAACaTCATTTCTG AAG 
CrpOTG rCOAAGTTATCCACXKJTAAAGT 

C rCATbCTGCGCTTGATTAAGCCAAAATACTTGATGCCTGTC 
rCCA X}CTGGACTAGCAGTGGATACTGGTGTrGTCAAGGAC^^ 
GT 3CTTI JCCCTTACTGCTGACTCAGCTCG^ 
AA VTOG rATCQGTGAAATTGGCGCAGCTGTCCTCAAAGA 
CTAGCA JTCGCAACTGTTGACTTCAAATCGC^^ 
TT< ?rCT> CATGAaAGAGTGTGGAGAdrrGATT^ 

GC, \CTQ UUAATAAGGATGCTAGTOTGCAAIXrrGTCAA^ ~~ 

TC' ATGi LAAATACCGAACGlXxAACCGATCATCATGCCGv 
CC^CCA< X^VCCACCACTGA 

2.CFE94 _ . . . 

AT<jK#^ 

AA< JGCC lAGACGATTGAAAAATATCTAGGC^GAAACTAC 

GA* TTO> AGAAATCCAGTATGTCCGTCGtATATTGAAAATAATTATGAACCGCAATATA 
GAj lAAG jCCCTCTTATCAATGACnTGAAAAAAGAAGCT 
ACC CGGi LCCGTOAAGGAGAAGCGATTTCTTGGCATTTtK3C 
Cjy GCG' GTGCnOTCAATGAAXTCACCTUGGATGCAGTCAAA 
GA1 AtGWfACTTGGTttoATGCCCAACAAGCTCGTC 
CIV TTTT STGGjiAGAAGGTCXAGAAGGGCTTCTCAGCAGG 
CAT TGAC CGTGAAAATOAAATCAATGCCTTCCAGCCA 
AA( fGGA ^CCaAACAATTTC^TGCITCCTT^ 
AAC GAA< >TCAAGGAAGnrcnxnCTCGTCTG 
GAC!CGT> AGCGCAATXKrtSXnTTACCCT 
All rCCG TACTtXiAAAAACCATGATC^ 
TGAJAGOi TTGAJTACCTAIATGCGT^ 

ITACtKJATCGTTrrGGTAGCMGtATTCf 
W TGCCCATOAGGCTATTCGTCXX^ 
GGWCAA( iGATCAGCXCAAGCf ATATACCOT^ 
GTT nta ITACC^TTOGCTOTTAAATlteTCTCAAAAAGGGGTT^ 
GTT [XJAt JOTTATCTTCO^TrrA^ 
ATC FGGT 2AAAfc^GGTCAATACK3AAACCAGAGCAA 
AAC \CTQ ATTAflAACCTTA^CWAAAATGGGGTO 
CAT PCAQ lAAC}ymmTCTT(X3CCTGGCAGCCAAACG 

AAT vagc ^t6gttgaatatttcccagatatcgtaa^ 

TGG VTGA rGTCGAAGTCGGAAAAGACk:AGTGGCGACGGGTX^TtGATO 



^ AGdnidA 
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CCA 
^ ACC. 
V CQTG 

\£ GCT 



GAT 
GGT 

cm 



k AAG" TCCOAAGGCTGAAGAAGAAATGaAAAAAAT^ 

<K:rCpASCAC|ACCACCACCACCACTGA 

9 rm<)< "homoloaue of SEQ. ID NO. 91" • 

it ™ WrtCATCAGTGCTGGMTT^^ 

TO 3GAC rCCTA^AATGGGAGGT^GGTTTTCnTGA 

ag ffi SinKcKo^^ 

SSraTGGAGTTATC^ 
u ct Sat TCG^ATTTITCT AT AT^ 
* AC GAC ^GTOTTGACG^ 

' TP ScJ ^GGTCAGATGGATATTCTTCTAGTGATTXJrrGCX^ATGATTGGTC 
TA ICCA rAAGaWcAAGGTCTTTATGGGTOATGTGGGAA^ 
li Ail TCIV /TGGCTCTCCACCAGGAATGGACTCTCTTC 
TO' TATt ^tqcaAGTCAGTTATTTCAAACTQACA'G^ 
Afe ^ TOAQOTXiGGGGATTCTCTOGTAAAGGAAATCCTra 

TJX iGGG uSnSoiCnWAOCAAOTC]^ 
ACI^CCiCTGA; 

"homoloaue of SEQ. ID NO.> 92" ' • ' • " 

AT<iGCAfcGCGAATnTCACTrGAAAAAACT 
cL ^AA inrACTOAGCGTATTCTrTACTACACTGGTAAAATCCACAA 

Gft !ACA, LArGQACTGGATGQAGCAAGAGCAAGAACGTGGTATCACGATCACATCTGCTGCaACAACAGC 

to atg< i aacaaccaccgcgtaaacatcatcgacacaccaggacacoto 
■ ac< irrc cttogtotattggatggtgcggttaccgttct^ 

AAaCAO [TTGGCOTCAAGCAACIXJAGTACGGAGTTCCACXjTATCGTA^ 
C<j< ITGQ r X3ACTiTCCTTTAClX3XOTAAflCACACTT^ 
TOCCA^ C^TtCTOAAQATGACnTCCa 
TAJ CGA(XnTOGTACGGATATCCTroAAGAAGACATC^ 
CXT GAAj aATTUATTGAAGCAGTTGCTGAAACTOACGAAGAATO 
GAj ATA lCTAACOAAOAATTCaAAGCTCGTATCCGTAAAGCGACTATCAAOT 
TCI GTGC TTCA<XXHTCAAAAACAAAGGTGTTCAATTGATC 
™'CmACATTCCAOCAATCAAAGGTATT^ 

Aka LGCCATrrGCAGCTCTTGCCTTCAAGATCA^ . 
GTTT llGTCAGQTGTlxnTCAATCAGGTTiCATACGTATTGAAT . 

atccttkaaatgcacgctaacagwgtcaagaa 
ttgg{Htoaaagatacta^^ctggtoact^ 

ntCAAil CAAOwTTOCAGAACCAGTrATOCAATTGATXKiTTCA 
GGGf] ATCGCXXOTGAAAAATTGGCTX3AAGAAGATCCAAC 
h GAA> CAGTTATCTXJAGGTATGGGTGAACTT^ 
CAW AOH GAAqCGAACGTAGdTGCrcCTCAAGTATCTTACCGTGAAAGAT^^ 
CCfC QOA1 TXTFtsAAACGTGAGTCTGGTGGTAAAGGTCAATTCGGTQATG 
AOC AAG-/ AGGmAAGGATTCGAATTCOAAAACGC^TCOTCGGTGG 
AGC QGTT QAAwju^GGTTlXMTAGAATCTA 
AAAaGTAAQC#TATOA!IGGtTCATATCACGAtGTCQACTCA 
act rrcc* 2TTAAAGAAGCTGC1»AAtcAG^CaACCAGCTaTCCT^^ 
ACT }TTC 2AGAAQAAAACCTTCKJTGATGTrATGGGTCACOTAACTGCTCGTCQ 
TOO \AGC ACACGGTAACAGCCAAATCGTTCGTGC^ 
AGT rCIT< »TTC(TXK3ATX?rCAAGGACGTGOTACATTCATGATW 
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GA 



* WjMrCA<tTA(^^ 

• i| i 

SLCFfesfy "homologue of SEQ. ID NO. 93" 
AT }CCA!AATfACMTATrCCATTTrCACCGCCTQATATC 
CC "TQC 3TTC^TTGGATCACAACAGGTCCTAAAACAAAAG 
ACACAC ACCTMGACTGTTTQTCTC^ 

TG }GAC ctggtgatoaagtcatcgttccagccatgacct 

CC 3AGC AAGOCCTGTCATGGTGGATATCCAAGGAOATACGTTTGAQATGGACT 
CC TATC V.CTGAQAAAACTAAGGTGATTATCCCAGTAGAGCTCGCAGGGATTG 
TO (TCQ JlGTCGTGGACUAAAMGGTGACTTCmACCGCTTC 
TA ITGT< JATTOTCTCTGATAGTGCCCACGCTTTGGOATCT 
CP JAiTT TAOTCOTCTCATTC^ 

AA ^GCC \ATC|^GTOATTGATGACGAAGAGATGTACAAGGAATTC 
^ CTj iAQG ^TGCTCTTGCCAAGATGCAACTGGGGTCATGGGAATACGATATCGTT 

CA \CaT 3ACCGATATCATGGCTTCACTTGGTTTGGTACAATTGGACC 
^ GT VAGGACATtGTGGACCGCTATGATAGTGGTTTrGCAG 
m IAAC rGTCGAATCTTCACGCCACCTCTACATCA 

cr< :\ra .tcc^agaattggctaaagcaggaattgcaagtaatgt^ 

CA 3CCT> LTAAGAATCrrGGATtTOATAtGACGAACTATCCT 
AC< XntX CTCTTCATACrAAATTAAGCGATGAAOAAGTAQACTATATC^ 



CT^AAAMGTbCtAACTrr^ 

2 CFi>* . "homologue of SEQ. ID NO .95" 
AT( ^TTrtATACTTATTTGCGTXKJATTAGTTGTAinrO 
AA' ACT( jATAAAATTCtn*AATGAAGATGAAAATTATATTTTAGTTGC 
CTC TTT> TATGGCCTTTGCGACCAAGCCAAAA 
CCC tVATt ITTtGOTTTCTGGATTCGTATGTGTGGCGCCTTTCCC^ 
CQJ TCAJ ATATCCTATCAACGTTCTCAAAAAAAGTGAGCCTCTCTCTC 
Cj^< riiCAJ ACGATGTCAAGGGGGGCGCAGCACTC 
ACC TAO .CCGGTCCCATGACTTTGAAGGGCTTGATTAG 
Y£ GA> XCtij TATCTCAGATATCAAGAAAATGAATCUTGAAGGCATtG 
CAC lAA*^ XX^VUCGTCTGGACGAAGAAACOAAA 
G& TTA3 CCG^TCCCTGCCCTCATCCTIS^ 
CA> .GCP1 CATCTOGAACCCAGATAAGAAAAGAGAAGAACrnX^ 



l! 



2c*dl0i Vhoraologue of SEQ. ID NO. 97" 
ATC ACC^CGAAXTTWACATTTTGAAAA 
CAC CtCC rTTCACAQAAGAAQAATTGGAATCTATC^ 
f AC AGAT ATC^TCtOCOGTTOflCTCATCro 
CXA AAGfC IAATT^CCTCCAACX3TGAAACT 
GTI M/i AATOGACAACCAGTXXSCCTACTTCAAA^ 
AQ1 TGOl rACAj^aATGGTTTTCTCT^ 
G<W TBTC nXJAAAGCTATGATATGGAAGCTCTTCrrCA^ 
TAG aTa*I IXXnXjTCTATTCTCATQAAGTTTACXL^CATCG 
A TOA TUT xTaAT^GTTGAGGGAATX^AATG^^ 

TCT rrtii nTTTpCAltTATGTAGATGCTGGAGTCiGATC 
AAA AtGfc TCAGI^AGGCCAAAACGACCCroA 

aac ixscw AGanrrrcc^ 

AAjC CAQ> AAT^TO(^AAGTGATTC1TCATAAAA 
AAAGCTtGAGCj(l^ 

va 2*CF£0i "homologue of SEQ. ID NO. 98" 
^ ATG jAAAiTrrCArrATTAACAGATGm 
^ TCAjuAG^GCTflG^ 
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3 Cf€ /o2- 



T 



TA GTGA AATGGCGGTCACAGACCTGGGTGTAGCTTGGGTO 

OG TGW TGGTffCGCCCATTACCTAGAAATTGAAAATC 

AC kGM GCAtGGGAACTACTTTGGAAGTCCTTGCTAmTTO^ 

GA TTCG CGTATCGGCItGATTCGTGG AGAAGAATACCATC^GTTGACGAGCGATGATTCCTTGGT^ 
AA ITG£ rCAAOGCTGGTCAATTGACACCAGAAGAGGCAGAAdCTCATCCGCAAA^ 
AG rcrA ITGGCCAAAAAG ATGAAATTCAGCCTGATTTTC 
CT rGCT lAATAQTGACGGCTTGACCAACATGATTrCAQGCAGTGAGATTC 
AT rcpT TAGC^OVTAAAACGGAGACACTTGTTCGTTITGCT 
CGPTTG XGT^TTTCTATGAACXjAGGA 

CFfclOJl'LhomoXoque. of SEQ. ID NO. 99" 

at}acgKtacagatgaaoaatacaggtaaacgaattgatctgatagccaatagaaaaccg<^ 
AH 3 gtt ptgtAtga attgcgagatcgtttgaag agaaatc 
1T< JTCA' ttccattggcggggaiggtatgctcttgtcggccttt 

CT X?GG rTTATCOGTCTIX^TACrGGACArrTGGGCrrCTATAGA 

GO rAGT jACTAATTTGCAACTAGATACTGGGGGAAGGGTTTCTTACCCTGTTCT 

Tfl j AAA VTGGTG AAGTTAAOATTTTCAGAGCACTCAACG^ 

Gt3 rCjG(fc \GATATTGTAATAAATGGTGTIX£CTTrGAA 

CGI 3Afct< /GTAGTACTGCCTATAACAAGTCnXTITGGCGGTGC 

AF WAQ 3GAGATTGCCAGCCTTAATAATCGTGTCTATXX5AAGATTGGG 

AG 3ATAVGATTGAACTTArrcCAACAAGAAACGATTATCATA 

TfC JCGTV ATaITGAGCGTATTGAGTaTCAAATCGaCCATCATAaGATC 

TA< ICAQ' TTCTGGAACCGTGTTAAGGATGCCITrATCGGTGAGGltK5ATGAACT 

CACCACX5A - . 



a cm 



fcCFfe 



AW 

ACa< 



TKKC 



^homologpie dcf- SEQ. ID NO*. 100" 
lAGAAArTAAATTlTCATCAGATGCCCGTTC^ 
j^AAAGTAACXTTGOGACCAAAAGGTCGCAA 
GAtTAQ IAATGAGG GTCTGACCATTGCCAAAG AAATCG AATTGGAAGACCATTTf^ . 

< nATCAGAAGTAGCTTCTAAAACGAATOATATCGCAGGTC 
„ „ r _ -/ AGCTATCOTCCGTGAAGGAATT^UAAACGTCAC^ 
T<Kf GATT GAAACAGCAGTTCXX&CAGCAGTCGAA 
.Gtf rATCGCTCAAOTTGCAGCCGTATCrrCTC 
ATCGA4- lAAOflTaGCAAAGACGGTGTCATCACCAtCGAAGAGTCA 
GTQQTKt fAAG6AATGCAGTrrGACCOrGGTtA(XTTTW 
TG 1 / CCTTGAAAATCCOf ACATTTI^^ 
*G QAAAGCATTCTCCAAAGCAATCGTCCACTCTTOA 
iA CtC^GTTITGAACAAGATTGGTGGA^ 
TQ4CCG1 CGCAAAGCCATGCTTXjAAGATATCGCCATCTTAAC^ 

JTtC AGTTpAAAGAtXXXjACAAlTGAAGCTCTTGGTCAAGCAGCGAGAGTCACOT 
iCQi rrTATTGTAGAAGGTGCAGGAAATCCTGAAGCGATTTCTC 
AAATd© AACTaCAACTTXHXJAATTTGACCGTGAAAAATTGC 
TGTlAGCC GTTATTAAGQTTGGAGOXiCAACTaAAA 

CTf \ACGCTACTCGTGCAGCTX5T1XJAAGMGGTAT^ 
GTdATTf (^GGltJTTGCTACCTIXiGAArraACAGGAGA . 
CTT rGGA AGAACCCGlTCX3TCAAATTGCTCACAATG^ 
AA/ AAK ICTGA(KTIXK5tATAGG Am 
ATCf*T]r<5[\TC^GTTAM^ 

G^QTCGTAGO^TAAACCAGAACCAGTAOCCC^ 
GATGATGCTCGAGCACCACCACCaCCACCACTGA 



C1TTTG 
CTICCAA 



JAACA[Qi«Al 
TGGOGGf 



OS ' "homologue of SEQ, : ID NO* 101"' 
ATGkTTAAGATTG/^ 

GTC KVtA :CACrACAACTACAGGAMQTTATTITTX}ATAGCATCAGCT 
AGA CACT JH I \V['l I GCAAAAXjGCGCTGCCITTAAAaMGAATACCTTGTTTCTGCT 
TAGj^jJTATtjfAGCTGAAMCGACTACGM 
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CPS Aon*-*f*9 u < Sf~- fb Aid; /o/ 

AG£cAtIjaGTTTGATTGCCATGGAGTTCTATGGTAATCCACAA 
0G'ACTMGOOTAAaACAACAGCAACCTATlTC(KXTATAACATOT 
^ It* TGT( CACCiATGAAGACMCTCTTGATGGCGAGACTITCmAAQTCAGCGTTGACAACCCCTGAGAG 
5 TA' TGA( XnrmGAGATXJATGAATCAGGCTGTGCAAAATGACCGTACCGACCTCATCATG 
. < AG CAA SCCTATCTAOTCCATtGAGTCTATGGACTGAC^^ 
\V>, TO; iCC.4' *ATOKiCCCGATTOAACAC£CTAGCTTTO^ 

AA AGO :gaggagtcatcattaacagtgacatggaccacttctcagtcttgaaagaac^g^ 

^ AAl >AC<? \^roAshrrCTATGGTAGCCAATTTGATAACCAAATCGAGAATrcCAAAGCCTTTAG 
N AC(iOGT ^AACtCGCnXKSAGATTATGATATOCAACTCATTGOCAACTTX^VAGCAAGAAAA 

CTCiGAC TGCTTOTCTCCGTCTCGOAGCAAGTCTTOAGGACATCAAAAAAGGGATCGCTGCAACO^ 

TOO" 'CGTAfGGAAGTCCTCACTCAGAAAAATGGAGCCAAGGTCrrCATCOACTATGCCCACAATO 
GA' V*T( rTGAAAAAACTCATCAATGTGGTrOAAACTCATCAAACCGGAA^ 
Cfu iCAG }AAACAAGGGAGAAAGTCGTCGTAAGGACTITGGCCrrcCTCCTCAATCAACAC^ 
AAl iTCT TCTGACTGCTGATGACCCTAACTATGAAGACCXIaATGGCCATTGCAGATGAAATTAGTAGCTa 
C\: CM" ^TC^TTGAAAAGATTGOIGATCGCCAAGAAGC^^ 
TO CGA lTTAGATGCAGTTATTATTGCGGGTaaGGGAGCCGATTGTTACCAAATX^TCC 
GA, lipC 'ACCCAGGAGATACAGCCGTCGCAGAAAATTATrrACTCGAGCACCACCACCACCACCACT^ 

i"homologu£ -of-SEQ. ID NO. 102" 
A1X IAt« lAAATCCKlCAAGATTTTTGCCGGACGCTATCG 

CQ( IATG' xtacctagccaaaqacttaatcttagatggggaagaagtggcagtgaaggt^ 

TCC TOAI AKXlTTCGGArAACAGATATTGGrGAGGAAGACGQTX^AACAGTATCTrGGAA^^ 
G^GGACITAGACCTCAAACGCTAtATCMGGAACATTATCCrc 

A A/ TAT( CpTtGACACCAGATGGGACTGCCAAGGTCACAGACTITQGGATTGCTGT^ 
AC; AjtalTGAC^GACTAACTCOATGTre^ 

^ AGC CTTC TATTGGTOGCAGCTrcrcTTATrroGATACTATCCAGAACt 

GA/ GAU , G AGGCTAGrGAAAAGGTG GAA GAAG G<jCGG ATTaTCCGIV\CAGATC 
GTCC3GCCGGAAAimCTGATOTTATCaCGGAA 

^GAWICTAATATAGAAGTTOTA^^ . 

G^CAA/GTC^AOAOCAGGTGAAAAGGTAGAayrAAATAAG^ 
CCX WiA^CAAC^GCTACTCCTGC^ 

fi,CFSl(^ % ."horti9logue 6f SEQ. lb NO. 103" 
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GA KiAC CAGTTTOTCCTTAAGTCAACKjGATCAAACXrrT^ 
OT \AG1 ITTArCCTTCGACTGGTTCGACTGGCT^ 
AC -GAT^TTGAGGCTGCTGaAAGTCCTTTATTMCAGATTTTCCACATaaa 
GG ^GATGTGACCCTMGTTATGGTAAGC^TGTCATCACTCATOAm 
CA 3GTC rTGCTGGCCTACGCATGTCTAGCriTrcTCAAAGGTC 
CA \CTT XHXjAGAAGGACTTGGTTACATTTCTAGAAGAAAATC^ 
AA iCCTTGTTACCAGAAttCmGGCCGAATnTTTQTACAAQ 
T& \AA A 3GAACGAGAACAACTTGTC<^GTCCATTAA^ 
Ct TGCi lAAGTCCTTTGTTACCAAGGGTGGAGTCAGT^ 
^ AA XifTG jTACCTGGCCTCCACTTTGCAGGCGAAGTTATGGATATCAATGCCCA 

Tpi ictT< tgcx:ctctgtaccggctgggtggcgggaagtctgcattatgatctcgagca^ 

&Cimi<tt' ,, homologue of SEQ. ID NO..; 104" 
AT( SCTGi^TGGGAAGAGTTGCCTGTGGAAATGA^ 
Cp JVAA JGAAPGGrTCGCTGATTtTCAAGCGTrocnX}^ 
AC' TCTC CCATCTTTCTGATXnmVGCAT^ 
$ AfeAqrarGTGACCCAGTACAACCGT^ 
x AAAAAAC5GAAGTCTXK1TGAOT 

TC< OTTI GGACGMCnXjCCTCAATTGGTGAATGTCCTTAAAGGTC 
v-*\ GA. LGJG :CACGmTACAOAGCAGTATAGCCCTGAAATX3ATGGCAACCTrcCT 
GC; X1*TCC AGCCAGCATCAaCTACAAGGATGAGGaCaCCATCATCAGTCAAATGA 
CA< )TTGj LTCAGGCCTATGTGGAGCATGTTClTCGTGAAAAGATGCGCrATAACCT 

GT1 TAG1 TTCTTTGGGG ACATGAAA ATCATG . 
ACCACCACCACTGa 

>l ■ I . ■ ' 

2.CFI1W "homojogue of SEQ. ID NO. 1.05"- • 
ATC IACTAgTCCUcTATTAGAATC^^^ 
OTl COW ACGQ^TCGAAACTGCTTGTrc^ 
CT1 CCAC CCnmGATAGACCTCOrmnCOTGlTCAAOCtA^ 
^ CTC AGO .TTTAAAAGCAGGTTGGACCATTGAACGCITAACGCTC^ 
W ACT Cm GAAATC^CTTCATTTGACACTCXrrCAGCTG 

ACITCTCCGATCAAAAATCTXKXCGTrTTATCAATXSGACTGC^ 



CO« GCA< £ACCACCACCACCACTQA 



V 



O-CFElli 'homologue of SEQ. ID NO. 107' 
^ A jrM^^ T OTAAC» A Tr A TTO^^ 

TTC CCAA CAAATCAGGAAAXGGaAAAATCGCTTTCTrACAATTGCGTC^^ 
TOT 0AC1 mAj^Aa^AACmOTCGAAA^TTOCTGAA^ 
AA/ CGCT TX^GCCAAGAAACGTCTGTmTGTGACAGGTATTOTC^ 
OCT ATG* GTI^GACATCACAGAMTCaAAG^^ 
AAC ACGC iAACAGACrriTrcATGGATAACCGTGACTTGTGGCTA 
GC* AAK CGTAACGCTaTTATCTATGTAACTTATQAGTTCTTTGAG^ 
GOt CAAT lCrntAC<UAATG(XXK^GAAGATrCT 
AGC CTAC pPGAGCCAATCAGGTCAGCTITACCTAaAAGCAGGGG 



ATp 
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acPflte w n hoznologue of SEIQ. ID NO. 10.8" 
AT< rTCT( ?AAAAATTAGTAGAAATCAAAGATTTAGAAATTTCC^ 
CG< jTTA lAAATQCTAACTIUmATCAAC 

GA LAAC \ACTATrGGTCGTGCrATCATOKjTCTAMTGATACAAG 
CPu iAXQ VTT^TGGTAAGAAATCGCGTGAACAAGCTGCXjGAATTGATTCXjTCGAA 
ITCCraCAAGTTTGAATGAACGTO^ 

TTA>^GATGAAGAAGAACGTAAAGAGAAAGTTCAAAGTATTATCCGTC 
CT(i\CClCnTGACTCGTTACXX 
TT< cnaTGCAACEAGACTTTGTTATTC 
I ^ AG' CTTC AACTTGCTCAAAAAATTCCAAAAAGAGCTCGGCTTGACCT 

b£ c6( irra tcgctttatttcagatcgtatcgcagttatttacaagggto 

GAj iGAA rTGTTTAACAATCCAATTGACGCATATACTCAAGCCTTGC^ 

M 1 XZTTC GAAC5DTAAGAAGGTCTTGA AGGTTTACGACCCAAGTCAACACGaCTATGAGACTGATAAGCC 

At ( TATC GTAG AAATCCGTCCAGGTCACTATGTTTGGGCGAACCAAGCT^^ 

GG JcTAj LACCtGG AGCACCACCACCACCACCACTGA 

i 

^ Crtl^ .."homoloaue of SEa. ID -NO, 109" 
ATX AAd^TAAOTTGGAATGGATTTTCTA 




OC(ICTC( TTAGTCCnXJAGAGACAAGCTAGfCTGGAGAAGGATGAACAGATGAGTO . 
Cki ICTO; XTOAGAATGTGGTGGGAACTTTra 

TC/ GGA> tTACACCGTTCCCTATGTOACAGAAGAACCCTGTGTGGITGCGGCG 
atc Ato AGCGTGCAGGTGGTTITACTGCACAAGTCCATCAGCGCCAGATO 
ATX AAtf TGCl|j^TCCTAAACrAGCGCAAGAGAAGAT^ 
CO ATC> AGCCTATCCTTCTA1X3GTTAAACXJTCGGGGTGG 
A<i( fCGA LC^6ACTTIX7rCGTTGf^ 

AA< ACQ .TGCTGGAAGCCITGAAACCAGTCnTAGAAGAACT^ 
C7C TCCA ACTAtGCGACTGArTClrCTGGTGACTGCAAGCTOT 
AAJ .GGA' X^dkiAOlAGAGATraCGaAGAAAATTGCGTre 
CCC AGC1 GQTACTCATAATAAAGGAATTTTTAATGGTATro 
G0( GTGC CATCGAAGCTGGGGCOCATGCCTTTCCCAGTCGAGATC 
OAC GCTT GACXJTTGAAAGaGAAGAATTGGTCGGTGAGATGACCCTGCCCAT^ 
TOC dcj \TCGGCCTCAACCC^CGTGTAGCTXrrCAGTCATGATCTACT 
TAC CCC> GAT^TCGTGTCCATCXjGTCttGCTCAAAATlTrc 
ATC CAGC AAGGCCACATGAAACTACAGGCCAAATCC^ 
TTG CTCC rCTAGTAGAGCGCCnX^TCTCA^ 

AA/ TTiA A^TC^GCGGCCGCACTCGAGCACCACCVCCACCACCACXGA 

2CFE114 "homologue ; of SEQ, ID -NO . 1 10" - 

ATC CGA/jTTACklt^^ 

AAC TCG/ TGAATTTTTAGATA'rtGTGGlTCGTC^ 
TXT 3CGT VTTAAGAGrTTAGAAGAGCGTTTGTCTTACTrt 
TAT K} AT rGCTOAGGATACAGGTOAGAGAGTGAAACAGGCGGCG 

ATC AAOC agaggaagaxgcgcaacgcttgttxksaagaagctaaatataagg 

AAC CAA( TGATAATOCTAAGAAAGTCGCnXfTTOAAACAOAAGAATTO 
ACC \ACC TCyrcAAATCTTACAATTOAGAGTCAGTTGGCT 
CGT XAA CAGCTACTTATCnTCAAACCAGTGATOAAGCCT 
AAC COAT rcC\pCTCCAATTGAAOAAGAACCAATTC 
AGA \TTA CAAGGIXXJTATTOAGGTAGCCGATAAAGAATTGTCTG^ 

AGT jQAA ACTcjCAACTCCTGTAGTGAGTXX^ . 
TGtp$|\GAAp^ 

homologue of SEQ. ID. NO. 111" 

ItgtaatggAcaactaattta^ 
ttctaccifaagogaaacklaggggagattagaggcgctagaagaaaga 
^gta^tgatgaagtagaagctgtaaaaaatatgcacttgattggac 
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' jets "S- 
(EfclGAATGGXATCAAAAATGGGTCGATTTAT^ 
AAi jCAG AAGGCTATAACCATTCATTTCGTTTTCTCAAGGCCA 
AA rTAv nrTCATTGAAGAAGATATTGCGGCAATTCGCAATG^TTTC 

AA VAAT ^GTGGTCGTGTTCTTCATGCmGGAmATTTGAGGAACTTCAGCATAGAGTTGCTGAAAATTC 
AGAACA 3TATGGTCAAGCCTTGGATCAAATTGA AAMCAATTAGAAAATATCCAATCTGAATTrrCA(iAA 
TT GTAJ lCCTTGAATTCATCGGGTGACCCTGTGGAAGCCGCAGTGATTTTGGATaATaCAGAAAATCaCA 

TT TGCS( 'ctta'Agtcatattgtggatcgtgttcgagccttgg 

CAi ITTAI 'AGGAmGQAAGCCGGmTCGTAAACTAATrGATGCTAATTATCATrrTGTTOAAACGOATAT 
TGj WW 3CGTTTCCACTTGCtTTATGAAGCATTGAAGAAAAACCAAGAGAATATTCGTCAGTTGGAAT^ 
OA TAAT jCCGaATATGAGAATGGACAGGCACAAGAGGAAATCAATGCCTTOTA^ 
AA, KTfQ XTQCTjCAGAAAOTAOTGOAAAATCTACTTGCAACTCTO 
GAVTAATACTTrtl 



.51 



CAT 
CCT 



GAATAAFACnfa^ 
AG< £ATi}TTCGTCGTATTCAGACAGM 
MOAAG/^CQAACCCAAGCTTATXX^GTTCTTGAAGAAAATC 
At) lTKL lAGATGAGCAAATTTCAGTTAGTGAGCGCCTGAGaCAAATTGAG 

^ GT( :aaA vggccaatgtttatgtcaatcgtctc^^ 
KT AG< jtat ^ccacaaactttcttoaagttattctttacggcaagcaataataccg 

TAC IA!Ad u\AAAATOATTAACATTGAATCnX3TTACCCGAGTTCTTO 
Ttl kGAJ ACGGAAACrtATAATATTCTACAATATGCMCTTTGA 

CG( TAfC SGCTflATITGATGAAGGCATTC 
TTC ATT/ TCACGCTTGATTTGACAAGATTTCTCAAGCATTG^ 
CTI TGTT iVCCrt>TATG AGAAAAGACGTGAAA(X1ATTC 
CA^CACrtGA || 

2.CFA116 "homologue-af-..SEQ. IDNO.112." 
ATC C^TATdTTATAAATGGTTAAAAGAATTGGTGGACATTGATGTG 
AAi iVGTi AACTACAGGAATCGAGGTAGAGGGtGTCGAATCACCAGCTGCTGGTCT 
CGC ITGA< « TCTTGtCTTGOTAAGATGTGCCAGAGACTCACXrrc 
GAJ lGa0 xmUQATCXJTTTQTG^ 

GA( ICTC< iTATOGCTOATAACTACAAAATCAAAAAAGGAAAAATCCG 
TCI GTTC ^CTTQGTOAATTQGGLAATTTCrGACrCAGTTGTC 
TKS CQT€ AAGATGCCGTGCCAGGTGAGGAAGTCTTTTGTTACCT 

m zcaj l^c^ccAaaccgtgcagatgccctttctatgtc 

TG/ CAACrGC^OTCAACTTTAAAGAATTTACTUr 
GTC ACK^ TTGApACAOAGAAGGCGCCTTACTATGCAGCT 
GTC CA(A ATGqi^AAAAarrrCTC 
<N CA/ CTAjC ATOrtGCTCTATTTTGGTXLVACCAATG 

TCC GTG1 3CGTbAAGCGCGTG(7rcGTGAAAAATTGGT^ 
GA/ It5A( CTAGTXIATCACTGTCGCAGACAAGCClAGTAG 
GA/ ATCJl CTGAAAAATCTAGTCGTGTTGTC 
AAC TGG1 CGCCTGAACCTTCGTTCTGAGTCATCTTC^ 
AA1GAAC tCCCTTCATGCGGCAGCTAGCCTGATTGCGGAA 
OTT rCAC CXKKn^GAGCTTGATACiTCAGATCT 
CCfC MCI GAGCTGTCTTATGCTGATGTAGAAGACGTCTTC 
CAC ACAC CTTf ACAGTGAGAGTCCQlCGTXXSTaSTIG 
AAI TGCT JGrAtCTAlXKiTTATGACCGCTTO 
TTG \CA€3 CCACACAAAAACTCCGCCQTCAAGTTCGTACTATTXKTO 
TC A :CTA rACTGTAACAACTCCTGAAAAAGCAGTTCA 
3TGd CGAATGACIGTGC^TCGTrCAGTCCT^ 
KCAA OTTGpCTMTAAGAATAAAAACTTGG 
IL^GMGAACTTCCAAATGAAATCAAC 
rGCAAACAGGAGCAGTTGCAGTTGATTT^ 
p¥nx*< JGACTKXJAAGTAACCTATACAGCMC^ 
gtc vrrr ^ACTGGGTGACCAAGTTCTTGGTTTCCTTGGCCAAGTGCA 
TAT X:CA( jAAAteTATGTGGCTGAGCTTAACCTTTCA 

TTT( iTAG, LAATtWCCAAATTCCCGGCAGTCAGCCGTGAC^ . 



AAFCCAL 
AA!A|GATl|r< 
TC( 
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\ < ATjCAAC AAGTTGTAGATGCTATCCAAGCTGCXX>GCGTGAAACG7TTtM 
,<v <\ CT rCXC ^GGTGAC3AAATTGGGACTTGGTATGAAGTCAATGGCTTATAGCTraACCTTCCAAA^ 
fcf^L, OA TAGC TTAACGGACGAAGAAGTUSCACGCTATATGCiAAAA^^ 
• AA TGCf GAAGTGCGTCTCGAGCACCACCACCACCACCACTGA 

• 2. CFBUTI "hioriiologue of SEQ. lb NO. 113" 

AT OTTA 3AAAACG>ATATTAAAAAAGTCCrcGTTTCAC^CGATOAAAT^ 
GC TGCI SAATTAACTAAAGACTATGCAGGAAAAAATCCAATCTTAGTrc 

</T nTA •GCKTOAATTGGTCUAACATATTGATAC^CATATTGAAATGaACTTt^ - 
' Q CA rCOT SOA^Gi^AGTAGTqaTOTTATCAATATTAAACAAGATGTGACT^ 
V rGTT rTATfTOTAGAACLiTATCATTGATACAGGTCVVAACm 

AA AGAC AAG<^GCTrCTGTTAAAACTGCAACCTT^ 
£\. AG GCAC ACTAjTACCTGOTTTACTATCCCAAATGAGTI^ 
Ww IT \TCG AATGTTCCTrATATTGGAGTATTGAAAGAGGAAGTC^ 

CA2CACPGA 
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Davison, Daniel B. 
Bruccoleri, Robert E. 
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10 <120> NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL 
FOR CELL VIABILITY AND THEIR USES 

<130> 30436. 44USU1 

15 <140> Not yet known 
<141> 2000-12-30 

<160> 226 

20 <170> Patentln Ver. 2.0 
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35 



40 



45 



50 



55 



<210> 1 
<211> 708 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 1 

atgatttatg caggaattct tgccggtgga 
ccaaaacaat ttttagagct aggtgatcga 
gtcttggaac caagtattga aaaaattgta 
gcagaagatc ttgtagataa atatcttcct 
ggtggtgctg accgcaatac aagtattgag 
ccgcttactc cagaggatat cgttgttacc 
cgcatgattc aagacagtat caaacttgct 
gaagcagtgg atactatcgt tgaaagtacc 
cgtgctcacc tctatcaggg acaaacacct 
ctttatggat ctctttctga tgaagagaag 
gtgatcaaag gaaaagatgt agccttggcc 
accgtaacag atttgaagat tgcaaaaagt 



actggcacac 
cctattttga 
gttggggttc 
cttcataagg 
aacatcattg 
cacgattctg 
caaaatcatg 
aatggtcaat 
caaacattcc 
gaaatcttga 
aaaggcgaat 
atgattgaga 



gcatgggaat 
ttcatacaat 
atggagactg 
aacgtattat 
aagccattga 
ttcgtccatt 
acgcagtgga 
tcattacagg 
gttgcaagga 
cagatgcatg 
actcaaatct 
aagactag 



cagtaacttg 60 
tgaaaaattt 120 
gg.ttttacat 180 
cattacaaag 240 
t'gcttatcgc 300 
tattacgctt 360 
tacagtagta 420 
tattccaaat 480 
cttcatggac 540 
taaaatcttt 600 
gaagattaca 660 
708 



<210> 
<211> 
<212> 



2 

558 
DNA 



<213> Streptococcus pneumoniae 
<400> 2 

atggctaacg taattattga aaaagctaaa 
gctcgtgaat ttggtggtat ccgtgctggt 
catgtagaat actatggagt cgaaactcct 
gaagcgcgtg ttttgttggt aacaccattt 
gccttgaacg cttctgatct tggtatcaca 
gttatcccag ctcttacaga agaaactcgt 
ggcgaaaatg ctaaagtggc tgtccgcaat 
aaacaagaaa aagcacaaga aatcactgaa 
caaaaagtaa cagacgatgc tgttaaacac 
gaacttttgg aagtctaa 



gagagaatga 
cgtgccaatg 
cttaaccaaa 
gacaagtctt 
ccggdtaatg 
cgtgaccttg 
atccgtcgcg 
gacgaattga 
atcgacgaca 



cccagtctca 
caagcttgct 
tcgcttcaat 
cattgaaaga 
acggttctgt 
ctaaagaagt 
atgctatgga 
agactcttga 
tgactgctaa 



ccaatcactt 60 
tgaccgtgta 120 
tacgattcca 180 
catcgaacgt 240 
gattcgcttg 300 
gaagaaggtc 360 
cgaagctaag 420 
aaaagatatt 480 
caaagagaaa 540 
558 
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<210> 3 
<211> 1353 
<212> DNA 

<213> Streptococcus pneumoniae 

5 

<400> 3 

atgggtaaat attttgggac tgatggagtc cgtggagaag ctaacctaga actaacacca 60 
gaattagcct ttaaactagg acgttttgga ggctatgttc ttagtcaaca tgaaacggaa 120 
gcgccgaaag tctttgtagg acgtgacaca cgtatttcag gggaaatgct ggaatcggcc 180 
10 ttggtggcag gtctccttt'c agtagggatt cacgtataca aacttggtgt ccttgcaaca 240 
ccagcagtag cttacttggt tgaaactgaa ggagcaagtg ccggtgtcat gatttctgct 300 
agccacaacc cagcccttga taacggaatc aagttctttg gcggtgatgg cttcaaacta 360 
gatgatgaaa aagaagcaga aattgaagcc ttgctagatg ctgaggaaga cactcttcct 420 
cgtccaagtg cagaaggctt aggaattttg gtagattatc cagaaggctt gcgtaagtat 480 
15 gaaggatacc gtgtgtcaac tggaactcct cttgatggaa tgaaggttgc cttggataca 540 
gctaatggag cagcttctac cagtgcccgt caaatctttg cagaccttgg tgcccaattg 600 
acggttatcg gggaaacacc agacggtctt aacatcaacc ttaatgttgg ttcaacacat 660 
ccagaagccc ttcaagaagt ggtcaaagaa agtgggtcag ctattggttt ggcctttgat 720 
ggagacagtg accgcttgat tgctgttgat gagaatggtg acatcgtcga tggtgacaag 780 
20 attatgtaca tcatcggaaa atacctttct gaaaaaggac aattggctca aaatacaatt 840 
gtgacaactg ttatgt.ctaa ccttggtttc cacaaggcct tgaatcgcga aggtattaac 900 
aaggcagtta ctgcagttgg tgaccgctac gttgttgaag aaatgagaaa atcaggctac 960 
aaccttggtg gtgaacagtc tggtcacgtt atcttgatgg attacaatac cacaggtgat 1020 
ggtcaattat cagcagttca attgactaaa atcatgaagg aaactggtaa gagcttatca 1080 
25 gagttggcgg cagaagtaac gatttatcca caaaaattag ttaatatccg agtggaaaac 1140 
gtcatgaagg aaaaggccat ggaagtgcca gctatcaagg ccatcatcga gaagatggaa 1200 
gaagaaatgg cggggaacgg ccgtatcctt gttcgtccaa gtggaacaga gcccctcttg 1260 
cgtgttatgg cagaagcgcc tacaacagaa gaagtggact actatgttga taccatcaca 1320 
gatgtagttc gtgctgaaat tgggattgac taa 1353 



30 



35 



<210> 4 
<211> 705 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 4 

atgaaaaaaa tactaattgt agatgatgag aaaccaatct cggatattat caagtttaat 60 
atgaccaagg aaggttatga agttgtaact gcttttaatg gtcgtgaagc gctagagcaa 120 
tttgaagcag agcaaccaga tattattatt ctggatttga tgcttccaga aattgatggt 180 

40 ttagaagttg ctaagaccat tcgtaagaca agcagtgtgc ccattcttat gctttcagcc 240 
aaagatagtg aatttgataa ggttatcggt ttggaacttg gggcagatga ctatgtaacg 300 
aaacccttct ccaatcgtga gttgcaggcg cgtgttaaag ctcttctgcg tcgttctcaa 360 
cctatgccag tagatggtca ggaagcagat agtaaacctc aacctatcca aattggggat 420 
ttagaaattg ttccagacgc ctacgtggct aaaaaatatg gcgaagaact agacttaacc 480 

45 catcgtgaat ttgagctttt gtatcattta gcatcgcata caggtcaagt catcacgcgc 540 
gaacacttgc ttgagactgt ctggggttat gactattttg gtgatgtccg cacagttgat 600 
gtgactgtac gacgtctgcg tgagaagatt gaagatacgc ccagccgacc agagtatatc 660 
ttgacgcgcc gtggtgtagg gtattacatg agaaataatg cttga 705 

50 <210> 5 

<211> 1107 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <4_00> 5 

atggaagaaa ttctctgtat tggttgtgga gcaaccattc agacgacaga taaagctggt 60 
cttggtttta ccccccagtc ggcacttgaa aaaggtttgg agactggcga agtctattgc 120 
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caacgctgtt 
gatttcctca 
gatatctttg 
gatgtcctct 
5 attagccagt 
ctaacttcag 
cgtaagggcc 
aatgctatta 
gggacaacct 

10 ccgggaatta 
gtcagtccta 
tttttaggtg 
ttctttgata 
gataagcacc 

15 aggctagtcc 
ggatggattc 
gtcgtcacac 



tccgtctccg 
agctcttgca 
attttaatgg 
tggtaggaaa 
ggctcatgaa 
cacaaaataa 
gcgatgtcta 
tccaagaaat 
tggacaaaat 
tccaccgcca 
aaaagga'aat 
gtttgggacg 
atgaactcaa 
tgggaactct 
agcatgtctt 
gtgtaacagg 
gaaaagcaat 



ccactacaat 
cgaggtggga 
atctgtcatc 
taaaaaagat 
acgtgcccat 
acatgccatt 
tgtggtcggt 
cacgggtgat 
agagattccg 
ccagatggct 
caagcctaag 
ctttgacttt 
actccatcgt 
tctgacacca 
taccattaaa 
cacagcaaaa 
tatttaa 



gaaatcacag 
gacagtgatg 
ccaggtttac 
atccttccta 
gaagaaggtc 
aaggaagtca 
gtgaccaacg 
cagaatgtca 
cttgacgacg 
cactacttga 
acctatcagc 
atagcaggag 
agcaagcttg 
ccaaatagca 
gataagacag 
gtcgccgtct 



atgtccagtt 
ctttagtggt 
cacgtttcgt 
agtcagttaa 
ttcgtccagt 
ttgacaagat 
ttggaaaatc 
tcactacttc 
gatcttatat 
cggccaaaaa 
ttaatcctga 
aaaagcaagg 
aaggagctag 
aggaaaaaga 
acctagtcat 
gggcaccaga 



gacgaacgat 
caatgtcatt 
ctcgggcaat 
gtctggtaag 
cgatgtggtc 
tgaacactac 
aactctaatc 
acgcttccca 
ttacgatacg 
cctcaagtat 
gcaaacccta 
atttactgct 
tgctttctac 
agatttccca 
ctcaggccta 
aggcgtcgcc 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1107 



<210> 6 
20 <211> 1461 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 6 

25 atgtatccag 
gtttactttg 
cagcctttta 
gaagacttgc 
gcgttcttgg 
. 30 ggggatttgg 
cagttggtcg 
gcagctcgta 
gctcaagaaa 
ggaaccagca 

35 catgccttgg 
cacaaaaatt 
gccattcagg 
tctggggata 
acagaggcta 

40 atgcaaaagg 
cagccggctc 
cgcaatacga 
gtgtggcgca 
ggtgtggata 

45 aagaagacgg 
atattagttt 
gacaagttgt 
gcgcgtgatg 
ggtgaaggag 

50 

<210> 7 
<211> 852 
<212> DNA 



atgatagttt 
accaagggat 
agaacggcta 
gtttttcaga 
attaccttcg 
tttttgctaa 
aaacggctct 
ttcgttcggt 
cggatgcggc 
acgtgcgtgc 
tacaggttta 
gtgtctttct 
tggcgcgtga 
ttgcctacat 
agatttatgc 
ccaagattga 
ttggggcggt 
ttaagctgtc 
ttaccagtcg 
ttagcgacat 
ttcgtaattt 
acaacttgcc 
gggatgagta 
tatggcaaga 
aagaagaatg 



gacattgcac 
tcacaataag 
tgcggttttt 
tagtgatata 
caatttcaag 
tgaaccgatt 
tttgaacatc 
tatcgaagat 
catctgggga 
gggtaagctc 
tggcaatgac 
tgtggatacc 
gctgggtgat 
ttctaagaaa 
ttctaatgat 
tgtctggggt 
ttacaagatt 
taataatgcg 
tgaaaaaggt 
gacagaaatc 
tgatgccgtt 
tagtttgact 
taagcgtgtg 
taagatggac 



acggacttgt 
aaggcggtct 
gcaggtttgg 
gcctatttgg 
ttggagttga 
gtgcaggtgg 
gtcaactacc 
gaacccttga 
acacgcgcag 
tttgacattc 
tatgaggctt 
tatgataccc 
cagattaact 
gtccgtcagc 
ttggacgaaa 
gtgggtacca 
gttgcaatcg 
gaaaaagtgt 
aagtcagaag 
aagatgttcc 
cctctcttgg 
gacattcagg 
ctcaatccgc 
ttgattgata 



accagatcaa 
ttgaggtgta 
aaagaattgt 
agtcgcttgg 
ccgttcgttc 
aaggacctct 
agaccttggt 
tggagtttgg 
cggtgattgg 
ctgttttggg 
tcaaggctta 
ttcgcatcgg 
ttatgggtgt 
aactggacga 
atactatcct 
agctgattac 
aagatgaaac 
cgacgccagg 
gtgattacat 
atccgaccta 
tggatatctt 
attatgcccg 
agcactatcc 
agatgcgcaa 



catgatgcag 
tttccgccaa 
gaactatctt 
ttatcatggg 
tgcccaagaa 
agcccaatgt 
ggcgacgaag 
gacacgtcgg 
tggcgccaat 
aacccatgcc 
cgctgcgacc 
tgtaccagct 
gcggattgac 
ggctggattt 
caatctcaag 
agcctatgac 
tggtcagatg 
taagaagcag 
cacttatgat 
tacatacatc 
caaagaagga 
taaagaattt 
agtggatttg 
ggaagccctt 



60 

120 

180 

240 

300 

360 

420 

4 80 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1461 
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<213> Streptococcus pneumoniae 
<400> 7 

atggctacta ttcaatggtt tcctggtcac atgtctaaag ctcgtcgaca ggtgcaggag 
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aatttaaaat ttgttgattt tgtgacgatt 
caaaatccta tgttgaccaa gattgttggt 
gccgacttgg ctgatccagc aatgaccaag 
atccagacgc tagctatcaa ctccaaagag 
5 gccaagaagc tcatggctga taagattgct 
accttgcgta ctatgattat cgggattcca 
ttggctggta aaaagattgc tgttgttgga 
tggcttaaaa ccaataaaga cctggaaatc 
tttgaggatg aaactgttgc acttaagttg 
10 cttcctatgg atgaggttac catttttggt 
aagctggctg aacgcttcaa acaaatgaaa 
gatatgaccc gcgccctcgg tttccgtgat 
aaggaagttc gtgatggcaa actcggtaac 
ggcaacgatt aa 

15 

<210> 8 
<211> 471 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 8 

atgattaaca atgttgtact tgtagggcgt 
ccatcaaatg tagcagttgc gacttttact 
aatggtgaac gtgaggctga ttttatcaat 
25 cttgctaact gggctaaaaa aggctcactt 
agttacgata accagcaagg acaacgtgtc 
caaatgttgg aaagccgtag tgtgcgtgag 
actgcaaact attcagcacc tacaaattca 
tttggagcaa caaacccatt ggatatttca 

30 

<210> 9 
<211> 975 
<212> DNA 

<213> Streptococcus pneumoniae 

35 

<400> 9 

atgaaaacgc gtattacaga attattgaag 
gcctgggttg ctgatggtga tttggcaggg 
atcggtgggg gaaatgcccc gaaagaagtt 

40 ttgactgata aaccctttgg ggtcaacatc 
gtggatctcg ttattgaaga aggtgttaaa 
aagtatatgg aacgtttcca tgaagctggg 
gctttagcta aacgcatgga aaaaatcggt 
gctggggggc atatcggtaa attaacaacc 

45 atatctattc ctgttattgc tgcaggagga 
tttatgctag gtgcagaggc tgtacaggtg 
aatgcccatc caaactacaa ggagaaaatt 
tcagctcagc actttggtca tgctgttcgt 
gaactggctg aaaaagatgc ctttaagcag 

50 atgggagcag gtgccctagc caaagcagtt 
atggcaggtc aaatcgcagg gcttgtttct 
gatttgtatt acggagccgc taagaaaatt 
gtaagaaatg actaa 

55 <210> 10 
<211> 423 
<212> DNA 



ttagtagatg cacgcttgcc tctatctagt 120 
gataaaccaa aactcttgat tttaaacaag 180- 
gaatggcgtc agtattttga atcacaagga 24 0 
caagtgactg taaaagttgt aacagatgcg 300 
cgccagaaag aacgtgggat tcagattgaa 360 
aacgctggta aatcaactct gatgaaccgt 420 
aacaagccag gggtcacaaa aggtcaacaa 480 
ttggatacac cggggattct ctggcctaag 540 
gcattgactg gagctatcaa ggatcagttg 600 
atcaattatt tcaaagaaca ttatccagaa 660 
attgaagaag aaccgtctgt gattattatg 720 
gactatgacc gtttttacag tctcttcgtg 780 
tataccttag atacattgga agacctcgat 840 

852 



atgacacgtg acgctgagtt gcgttatacc 60 

cttgcagtaa accgtacatt taagagtcaa 120 

gtcgttatgt ggcgccaaca ggctgaaaat 180 

atcggggtga caggtcgtat ccagactcgt ■ 240 

tacgtgacag aggtcgtggc tgagaatttc 300 

ggccacacag gtggagctta ctctgcacca 360 

gtaccagact tttcacgtaa tgaaaatcca 420 

gatgatgatt taccattcta a 471 



attgattatc ctattttcca aggagggatg 60 
gctgtttcca aggctggagg attaggaatt 120 
gtcaaggcca atattgataa aatcaaatca 180 
atgctcttat ctccctttgt ggaagatatc 240 
gttgtcacaa caggagcagg aaatccaagc 300 
ataatcgtta ttcctgttgt tcctagtgtc 360 
gcagacgctg ttattgcaga aggaatggaa 420 
atgaccttgg tgcgacaggt agccacagct 480 
attgcggatg gtgaaggtgc tgcggctggc 540 
gggacacggt ttgtagttgc aaaagagtcg .600 
ttaaaagcaa gggatattga cactacgatt 660 
gctattaaaa atcagttgac tagagattt-t 720 
gaagatcctg atttagaaat ctttgaacaa 780 
gttcacggtg atgtggaggg tggctctgtc 840 
aaagaagaaa cagctgaaga' aatcctaaaa 900 
caagaagaag cctctcgctg gacaggagtt 960 

975 
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<213> Streptococcus pneumoniae 



<400> 10 

atgatcgata ttcaaggaat caaagaagcc cttccccacc gttatcctat gcttctagtg 60 

5 gaccgtgtct tggaggtgag cgaggatacc attgttgcta tcaaaaatgt gaccatcaac 120 

gagcctttct ttaacggcca ctttcctcaa tacccagtta tgccaggtgt tctgattatg 180 

gaagccttgg cgcaaactgc cggtgtgttg gagttatcaa aacctgaaaa taaaggaaaa 240 

ctggtctttt acgctggtat ggataaggtt aagttcaaga agcaagttgt accaggcgac 300 

caattggtta tgacagcgac ttttgtaaaa cgtcgtggca ccatagctgt ggttgaagca 360 

10 aaggctgaag tggatggcaa gcttgcagcc agtggtaccc ttacttttgc aattgggaac 420 

taa 423 

<210> 11 . 
<211> 1023 
15 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 11 

atgattaatc aaatttatca actaactaaa 

20 gaggctattg accaagagaa tcatatcctt 
gcggatcagc gttactacca gggaaaacgt 
atggcaatga ttcacgagtc atgtggaatc 
gttggt'caaa aagttgtcat gattcccaat 
tatgaaaact acatgacagg gapccatttc 

25 gagtttgttt ctctccctaa agatcgtgtg 
gcagccatta cagagtttgt cagtgtgggc 
gctcatagca agcgggagcg gatccccgtt 
gccaatatta tcaactatac tttgccagaa 
gaaaagttgg aactcttctc atttgccaaa 

30 gagttggcct ttgaccatgc ttttgaatgt 
aatgacttga ttcgctacat tcgtcctcag 
tataaagtca atctcaatac tcgcgatgcc 
tctcgttctg gtcgcattga ttttgaaaat 
gccaatcgtc ttaaaaatat cctttatcta 

35 caccgtgtct ttgcaaccga tttaaacaca 
taa 

<210> 12 
<211> 1344 
40 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 12 

atgaacttaa aaactacttt gggccttctt 
45 cgtcttggac gtggaagtac gctcccaggg 
ttacaaaacc tagctaagaa ctacgagatt 
ctgacaactg ccctcactgt cggcatttta 
ccaagcggtg ccaacatgat tacagggatt 
aaaactggga aaaatattgc cgtcctcgaa 
50 gactatatcc agcctagtct ttttgtcatt 
ttcggtgaaa tctatactac ctataacatg 
gctactgttc tccttaacgg agacagtcca 
atagagtatt ttggttttga cttggaaaaa 
gaagggattc tctgtcctga ctgccaaggc 
55 aacttgggtg cctatatctg tgagggttgt 
ttgacaaaac tggttgagtt gaccaacaat 
tacggtatcc aaatcggcgg gctctataat 



cctaagttta tcaatgtcaa atatcaggaa 60 
atccgtccca actacatggc tgtctgtcat 120 
gatcccaaga ttttgaataa aaagcttcca 180 
gtcatttctg acccgagcgg aacctacgag 24 0 
cagtctccta tgcagagtga tgaagaattc 300 
ttgtctagtg gatttgatgg ctttatgaga 360 
gtggcttatg atgctattga agatacggtt 420 
atgcacgcta tgaatcgtct attgactctt 480 
attggagatg gaagtttagc ttttgtggtt 540 
gcagagattg tggttattgg tcgtcattgg 600 
gaatgctata ttacggataa tattcctgaa 660 
tgtggtggtg atggtactgg accagctatt 720 
ggaacaattc tcatgatggg agttagcgaa 780 
ttagaaaagg -gcttgctctt ggttgggtca 840 
gctatccaaa tgatgaaagt caagaaattt 900 
gaagaacctg taagagaaat taaagatatt 960 • 
gcctttaaaa cagtgtttaa gtgggaagta 1020 

1023 



gctggacgtt cttcccactt cgttttaagc 60 
aaagtcgccc ttcaatttga taaagatatt 120 
gtcgttgtca ctggaacaaa tggaaaaacc 180 
aaagaggtct atggtcaagt tctaaccaat 240 
gcaacaacct tcctaacagc caaatcttct 300 
attgacgaag ccagtctatc tcgtatctgt 360 
actaatatct tccgtgacca gatggaccgt 420 
atattggatg ccattcggaa agttccaact 480 
cttttctaca agccaactat tccaaaccct 540 
ggaccagccc aactggctca ctacaatacc 600 
atcctcaaat atgagcataa tacctatgca 660 
ggatgtaaac gtcctgatct cgactatcgt 720 
cgctctcgct ttgtcataga cggccaagaa 780 
atctataacg ccctagctgc tgtggccatc 840 
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10 



15 



gcccgtttcc tcggcgcaga ttcccaactc atcaaacagg gatttgacaa gagccgtgct 900 

gtctttggac gccaagaaac ctttcatatc ggtgacaagg aatgtaccct tgtcttgatt 960 

aaaaatccag tcggtgcaac ccaagctatc gaaatgatca aactagcacc ttatccattt 1020 

agcctatctg tcctccttaa tgccaactat gcagatggaa ttgacactag ctggatct.gg 1080 

gatgcagact ttgagcaaat cactgacatg gacattcctg aaatcaacgc tggcggtgtt 114 0 

cgtcattctg aaatcgctcg tcgcctccga gtgactggct atccagctga gaaaatcact 1200 

gaaacgagta atctggagca agttctcaag accattgaga atcaagactg caagcatgcc 1260 

tatattctgg caacttatac tgccatgctg gaatttcgtg aactgctggc tagtcgtcag 1320 

attgttagaa aggagatgaa ctaa 134 4 

<210> 13 
<211> 783 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 13 

atggtttata cttcactttc ctcaaaagat ggcaattacc cctatcagct caacattgcc 60 
.cacctctacg gaaatctcat gaatacctac ggggacaatg gaaacatcct catgctcaag 120 
tatgtggctg aaaaactggg agcccatgtg accgttgaca tcgtttctct ccatgatgac 180 

20 tttgatgaaa atcactacga catcgccttt ttcggtggtg gtcaagactt tgaacaaagt 240 
atcattgcag acgacctacc tgctaaaaaa gagagcattg acaactacat ccaaaacgac 300 
ggtgtagttc tggctatctg cggtggtttc caactattgg gtcaatatta tgttgaagct 360 
tcaggaaaac gtatcgaagg gctaggggtc atgggacact acacgctcaa ccagaccaat 420 
aaccgtttta tcggtgacat caagattcac aatgaagatt tcgatgaaac ctactatgga 480 

25 tttgaaaatc accaaggtcg taccttcctc tctgatgacc aaaaaccgct gggacaggtt 540 
gtctatggaa atggaaacaa cgaagaaaag gtcggtgaag gggttcatta taagaatgtc 600 
tttggttcct acttccacgg gcctatcctc tctcgtaatg ccaatctggc ttatcgccta 660 
gttactactg ccctcaagaa gaaatatggt caggacatcc aactccctgc ctatgaggac 720 
attctcagcc aagaaatcgc tgaagagtac agtgacgtca aaagcaaggc tgacttttct 780 

30 taa 783 



<210> 14 

<211> 276 

<212> DNA 

35 <213> Streptococcus pneumoniae 

<400> 14 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 

gactcagcag' cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 

40 gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 

ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 

aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 15 
45 <211> 840 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 15 

50 atgggaattg ctctagaaaa tgtgaatttt acatatcaag aaggtactcc cttagcttca 60 
gcagctttgt cggatgtttc tttgacgatt gaagatggct cttatacagc tttaattggg 120 
cacacaggta gtggtaaatc aactatttta caactcttaa atggtttatt ggtgccaagt 180 
caagggagtg tgagggtttt tgatacctta atcacctcga cttctaaaaa taaagatatt 240 
cgtcaaatta gaaaacaggt tggcttggta tttcagtttg ctgaaaatca gatttttgaa 300 

55 gaaacggttt tgaaggacgt tgcttttgga ccgcaaaatt ttggagtttc tgaagaagat 360 
gctgtgaaga ctgcgcgtga gaaactggct ctggttggaa ttgatgaatc actttttgat 4 20 
cgtagtccgt ttgagctgtc agggggacaa atgagacgtg ttgccattgc aggcatactt 480 
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gccatggagc cagctatatt agtcttagat 

agaaaagagt tgatgaccct gttcaaaaaa 

gtaacgcatt tgatggatga tgttgctgaa 

ggacgtttag taaagggggg caaaccaagt 

5 gaagttcagt tgggagtacc taaaattacg 

gtgtcattta aacgattacc ggttaagata 

<210> 16 
<211> 930 
10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 16 

atggatattc aatttttagg aacgggggct 
15 agtctcgccc tgaaactttt ggatgagatt 
ggtacgcaaa atcgcattct ggaaaccaca 
attacccatc tgcatggaga ccacattttt 
tttcaggcca atgaagagca gacagatttg 
tttgtcttaa ccagccttcg tgtgtcaggt 
20 gagtttgacc aagattctct aggtaaaatt 
gaggagctgg accacactat tttctgtgtt 
gggacgctgg atgctgaaaa actcaaggct 
aaaatcaaaa atggccagga tcttgttttg 
tatatctcag cgccacgtcc aggtaagatt 
25 gatgccagtg tgcgtctggc tgtcaatgca 
aagggtgatg aaaaaattgc tcgtaaccat 
gtagcggtag aagcaggtgc caaacgcctc 
tcaaaagata ttagcaaact caagaaagac 
gtcaaagact tggaagaagt ggaaatctag 

30 

<210> 17 
<211> 1662 
<212> DNA 

<213> Streptococcus pneumoniae 

35 

<400> 17 

atgagtaata tcagtttaac aacacttggt 
attgctgaaa ttggagagtc catttttgtt 
gaacaattag gggtcgatgt ggtgattcca 

40 cgtattgctg gggttttctt gacccacggg 
ctcttggcag aggctaaagt tcctgtattt 
ctctttgtca aaggaaatga tgccgttaag 
aatacggaga ttgattttgg tgggacagtg 
ccagagagtc tgggaattgt cttgaagaca 

A5 ttcaaatttg accaaacggc tagtgaatct 
attggtcgtg acggcgtcct ggctctcctc 
caggtggcta gtgaaagtga agttagggat 
ggtcgtatca tcgttgcagc tgtttccagt 
gctgcggata aaacaggtcg acgtatcgtc 

50 cgcacagcga ttcgtcttaa gaagttgtct 
aaagatatgt ctcgctttga agaccatgag 
gaacctatca atggacttcg taagatgtcg 
gatggggacc tggtctatat tgctacggct 
cgtgtagaaa atatgattta tcaggcaggt 

55 catgtatcag ggcacggaaa tgtgcgtgat 
aagtacctct tccctgtcca aggggagtat 
atggcagttg ggatgttgcc agaacgcatc 



PCT7US00/35604 

gagccaacag ctggtctaga tcctctaggg 540 

ctccaccagt cagggatgac catcgtcttg 600 

tatgcgaatc aagtctatgt aatggaaaag 660 

gatgtctttc aagacgttgt ttttatggaa 720 

gccttttgta aacgattggc tgatagaggc 780 

gaggagttca aggagtcgct aaatggatag 840 



ggtcagccct ctaaagcccg caacgtttca 60 
aacgaagttt ggctctttga ctgtggagaa 120 
attcgaccac gtaaggtcag caaaatcttt 180 
ggtttgccag gtttcctttc tagccgtgcc 240 
gaaatctacg gacctcaagg aatcaagtca 300 
tctcgtctgc cctaccgcat tcatttccat 360 
cttgaaatcg ataaattcac tgtgtatgca 420 
ggctatcgtg tcatgcaaaa ggatctagaa 4§0 
gctggtgttc cgttcggccc gctttttggt 540 
gaagacggaa ctgaaatcaa ggcagcagac 600 
atcactattt taggagacac tcgaaaaacg 660 
gatgtcctag ttcatgagtc cacttatggc 720 
ggtcactcaa ctaatatgca agctgcacaa 780 
ctactcaacc atatcagtgc ccgtttcctc 840 
gctgccacaa tttttgaaaa tgtccatgtg 900 

930 



ggtgtgcgtg agaatggaaa aaatatgtac 60 
ttgaatgtag ggttaaaata tcctgaaaat 120 
aacatggatt acctttttga aaatagcgac 180 
catgcggatg cgattggtgc tctaccttat 240 
gggtctgagt tgaccattga gttggcaaag 300 
aaatttaatg atttccatgt cattgatgag 360 
gtttccttct tccctacgac ttactccgtt 4 20 
tcggaaggaa gcatcgttta tacaggtgac 4 80 
tatgcaactg attttgctcg tttggcagag 540 
agtgattcgg ccaatgcaga cagcaatatt 600 
gaaattaccc aaactattgc tgactgggaa 660 
aatctttctc gtattcagca gatttttgac 720 
ttgacaggat ttgatattga aaatatcgtc 780 
ttagccaacg aaattctctt gattaagcct 840 
ttgattattc ttgagacagg tcgtatgggt 900 
attggtcgcc atcgttatgt agaaatcaag 960 
ccgtctattg ctaaagaagc ctttgttgcg 1020 
ggggttgtca aattgattac* ccaaagttta 1080 
ttgcagctga tgatcaatct tttgcaacct li40 
cgtgagttgg atgctcacgc taaggctgcc 1200 
ttcattccta aaaaggggac gaccatggct 1260 



7 



WO 01/49721 



PCT/US00/35604 



tacgagaatg gagactttgt tccagctgga tcggtttcag caggagatat cttaattgat 1320 

gggaatgcca ttggtgatgt tggaaatgtt gttcttcgtg accgtaaggt cttgtcagag 1380 

gatggaattt tcatcg'tggc tattacagtc aaccgtcgtg agaagaaaat tgtggctaga 1440 

gctcgtgttc acacgcgtgg atttgtttat ctcaagaaga gtcgcgatat tctccgtgaa 1500 

5 agttcagaat tgattaacca aacggtagaa gattatcttc aaggagatga ctttgactgg 1560 

gcagatctta aagggaaggt tcgaga.taat ttgaccaagt atctctttga ccaaaccaag 1620 

cgtcgtccag ctattttacc agtagtcatg gaagcaaaat aa 1662 

<210> 18 
10 <211> 951 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 18 

15 atgacaaaag aatttcatca tgtaacggtc ttactccacg aaacgattga tatgcttgac 60 

gtaaaacctg acggtatcta cgttgatgcg actttgggtg gagcaggcca tagcgagtat 120 

ttattaagta aattaagtga aaaaggccat ctctatgcct ttgaccagga tcagaatgcc 180 

attgacaatg cgcaaaaacg cttggcacct tacattgaga agggagtggt gacctttatc 24 0 

aaggataact tccgtcattt acaggcacgt ttgcgcgaag ctggtgttca ggaaattgat 300 

20 ggaatttgtt atgacttggg agtgtctagt cctcaattgg accagcgtga gcgtggtttt 360 

tcttataaaa aggatgcgcc actggacatg cggatgaatc aggatgctag tctgacagcc 420 

tatgaagtgg ttaatcatta tgactatcat gatttggttc gtattttctt caaatacggt 480 

gaggataaat tctctaaaca gattgcgcgt aagattgagc aagcgcgtga agtgaagccg 54 0 

attgagacaa cgactgagtt agcagagatt atcaagttgg tcaaacctgc caaggaactc 600 

25 aagaagaagg gtcatcctgc taagcagatt ttccaggcta ttcgaattga agtcaatgat 660 

gaactggggg cggcagatga gtccatccag caggctatgg atatgttggc tctggatggt 720 

agaatttcag tgattacctt tcattcctta gaagaccgct tgaccaagca attgttcaag 780 

gaagcttcaa cagttgaagt tccaaaaggc ttgcctttca tcccagatga tctcaagccc 84 0 

aagatggaat tggtgtcccg taagccaatc ttgccaagtg cggaagagtt agaagccaat 900 

30 aaccgctcgc actcagccaa gttgcgcgtg gtcagaaaaa ttcacaagta a 951 

<210> 19 
<211> 999 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 19 

atgagtagaa ttttagataa tgagataatg ggggatgagg agttagtaga acgcacgctc 60 
cgtcctcagt atttacgtga atatatcgga caggataagg tcaaggacca gctacaaatc 120 

40 tttattgaag ctgccaaaat gcgggatgaa gcgctggatc atgtgctctt atttgggcct 180. 
ccaggtctcg ggaaaacgac catggccttt gttattgcca acgaactggg agtcaatctt 240 
aagcagacgt cgggtccagt cattgaaaaa gccggagatc tggtagctat tttgaatgag 300 
ttagagcctg gggatgtact ttttattgat gagatccatc gtttgccaat gtcagtggaa 360 
gaggtgcttt atagtgctat ggaggacttc tacatcgata ttatgattgg ggctggtgag 420 

45 ggtagtcgta gtgttcattt ggagttacca ccttttacct tgattggtgc gacgactcgg 4 80 
gctggtatgc tctccaatcc gctacgggca cgttttggga ttacaggcca tatggagtat 540 
tatgcccatg ctgacttgac agaaattgtc gagcggacgg cagatatttt tgagatggaa 600 
atcactcatg aggcagcatc tgagttggcc ctacgtagtc gtgggacccc tcgtattgcc 660 
aatcgtctcc tcaagcgcgt gcgcgatttt gcccagataa tggggaatgg ggtaattgat 720 

50 gatattatta ccgataaggc tttgactatg ctggatgttg accatgaagg tttggactat 780 
gtggatcaaa aaatccttcg taccatgatt gagatgtaca gtggaggacc tgttggtcta 840 
ggaactcttt ctgtgaatat cgccgaagag cgtgagacag ttgaagacat gtatgagcct 900 
tacttgattc aaaaaggttt tatcatgcgg acacggtctg gacgggtggc gactgctaag 960 
gcatatgagc acttaggtta tgaatacagt gaaaaataa 999 

55 

<210> 20 
<211> 1311 
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<212> DNA • 

<213> Streptococcus pneumoniae 
<400> 20 

5 atgagtatgt ttttagatae agctaagatt aaggtcaagg ctggtaatgg tggcgatggt 60 
atggttgcct ttcgtcgtga aaaatatgtc cctaatggag gcccttgggg tggtgatggt 120 
ggtcgtggag gcaatgtggt cttcgttgta gacgaaggac tacgtacctt gatggatttc 180 
cgctacaatc gccatttcaa ggctgattct ggtgaaaaag ggatgaccaa agggatgcat 240 
ggtcgtggtg ctgaggacct tagagttcga gtatcacaag gtacgactgt tcgtgatgcg 300 

10 gagactggca aggttttaac agatttgatt aaacatgggc aagaatttat cgttgcccac 360 
ggtggtcgtg gtggacgtgg aaatattcgt tttgcgacac caaaaaatcc tgcaccggaa 420 
atctctgaaa atggagaacc aggtcaggaa cgtgagttac aattggaact aaaaatcttg 480 
gcagatgtcg gtttagtagg attcccatct gtagggaagt caacactttt aagtgttatt 540 
acctcagcta agcctaaaat tggtgcctac cactttacca ctattgtacc aaatttaggt 600 

15 atggttcgca cccaatcagg tgaatccttt gcagtagccg acttgccagg tttgattgaa 660 
ggggctagtc aaggtgttgg tttgggaact cagttcctcc gtcacatcga gcgtacacgt 720 
gttatccttc acatcattga tatgtcagct agcgaaggcc gtgatccata tgaggattac 780 
ctagctatca ataaagagct ggagtcttac aatcttcgcc tcatggagcg tccacagatt 840 
attgtagcta ataagatgga catgcctgag agtcaggaaa atcttgaaga atttaagaaa 900 

20 aaattggctg aaaattatga tgaatttgaa gagttaccag ctatcttccc aatttctgga 960 
ttgaccaagc aaggtctggc aacactttta gatgctacag ctgaattgtt agacaagaca 1020 
ccagaatttt tgctctacga cgagtccgat atggaagaag aagcttacta tggatttgac 1080 
gaagaagaaa aagcctttga aattagtcgt gatgacgatg cgacatgggt actttctggt 1140 
gaaaaactca tgaaactctt taatatgacc aactttgatc gtgatgaatc tgtcatgaaa 1200 

25 tttgcccgtc agcttcgtgg tatgggggtt gatgaagccc ttcgtgcgcg tggagctaaa 1260 
gatggggatt tggtccgcat tggtaaattt gagtttgaat ttgtagacta g 1311 

<210> 21 
<211> 519 
. 30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 21 

atgaactact ttaatgttgg gaaaatcgtt aatacgcagg gattacaggg tgagatgcga 60 

35 ■ gtcttgtctg tgacggattt tgcagaagaa cggtttaaaa aaggagctga gctggctttg 120 

tttgatgaaa aagatcagtt tgtccaaaca gtgaccatcg ctagccaccg taaacagaag 180 

aactttgaca ttattaaatt caaagatatg taccatatca atactatcga aaagtacaag 240 

ggatacagtc tcaaggtcgc tgaggaagat ttgaatgacc tagacgatgg tgaattttac 300 

tatcacgaga ttatcggttt ggaagtctat gagggtgata gcttggttgg aaccatcaag 360 

40 gaaatcctgc aaccaggtgc taatgatgtc tgggtggtca aacgaaaagg caaacgtgat 4 20 

ttgcttttac cttatatccc accagtggtt ctcaatgttg atattccaaa taaacgggtc 4 80 

gatgtggaaa tcttagaagg gttagacgat gaagattga 519 

<210> 22 
45 <211> 720 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 22 

50 atgaagattg atattttaac cctctttcca gagatgtttt ctccactgga gcactcaatc 60 

gttggaaagg ctcgagaaaa agggctcttg gatatccagt atcataattt tcgagaaaat 120. 

gctgaaaagg cccgtcatgt agatgatgag ccctacggag gcggtcaggg catgttgctc 180 

agagcacaac ctattttcga ttcctttgat gctattgaaa agaaaaatcc gcgcgttatt 240 

ctcctcgatc ctgctggaaa gcagtttgat caggcttatg ctgaagattt ggctcaagag 300 • 

55 gaagagctaa tctttatctg tgggcactat gagggttatg atgagcgcat taagaccttg 360 

gtaacagatg agatttccct aggcgactat gtcctcactg gtggagaatt ggcagctatg 4 20 

accatgattg atgctacagt tcgcctgatt ccagaagtga ttggcaagga gtctag'ccac 480 



9 



WO 01/49721 



PCT/US00/35604 



10 



15 



20 



caagatgata gtttttcttc aggtctttta 

tatcgaggca tggtcgtgcc agatgtattg 

tggcgattgt acgagagttt aaagaaaacc 

tatcaactga cagtagaaga agaaaaaatg 

<210> 23 
<211> 561 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 23 

atgattgaag caagtaaatt aaaagctggt 
attcgcgttt tggaagctag tcaccacaaa 
aaattgcgtg atgtccgtac tggttctaca 
tttgaacaag ctattatcga gactgtccca 
gcatacttca tgaatacaga aacttacgac 
aacgaattgc tttacatcct tgaaaactct 
gtgatcggtg tcaccgttcc tactactgtt 
atcaaaggtg ctactgttac aggttctggt 
gtaaacgttc cagacttcat cgaagcagga 
acttacgttt ctcgtgccta a 



gaatatcatc agtacacacg tccctatgat 54 0 

atgagtggcc atcatgaaaa gattcgtcag 600 

tacgagcgca gaccggattt acttgaacat 660 

ctggcagaaa tcaaagaaaa caaagaataa 720 



atgacctttg 
ccaggtaaag 
tttgacacaa 
gctcaatact 
cagtacgaaa 
gatgtgaaaa 
gagttgacag 
aaaccagcaa 
caaaaactcg 



aaacagctga 
gaaacacgat 
gctaccgtcc 
tgtacaaaat 
tccctgtagt 
tccaattcta 
ttgctgaaac 
cgatggaaac 
ttatcaacac 



cggcaaattg 60 
catgcgtatg 120 
agaggaaaaa 180 
ggatgacaca 240 
caatgttgaa 300 
cggaactgaa 360 
tcaaccatct 420 
tggacttgtc 480 
tgcagaagga 54 0 
561 



<210> 24 
<211> 1572 
25 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 24 
atggcatttg 

30 ggaaaaatct 
gaggccgacg 
■ gtcgggcatg 
gaggaactga 
ccaaccatca 

35 ttggccaaca 
tatcgtccag 
tttgcacttg 
caaactaatc 
ctcctcatga 

40 gtcgttgatg 
ttggaagtga 
ctgtctgttc 
acggacattg 
.atgctcactt 

45 gctgagaaga 
gtgcaaaata 
ccagcccttc 
tcttcgatga 
cgtattgctg 

50 aaccaggcta 
caaatgggga 
atgtctgccc 
ggagcaggaa 
gaatttgcta 

55 aaacgcaagt 



aaagtttaac 
ctgaatctga 
ttgccttgcc 
aggtcattga 
cagccgtttt 
tcatgatggt 
aactcaagaa 
ctgccattga 
gaacagaagt 
ataacgacta 
atgagcttcg 
ctatgattgg 
ctggggtcat 
gtcacattac 
aaaccttcca 
tgattgagaa 
tgcgcgaaaa 
tggggccgat 
aaaacatgaa 
cacctgaaga 
ctggttctgg 
aacagctcat 
ttaatccaaa 
ttgaaggaat 
tgccagatat 
tgaaacagtc 
aa 



agaacgtttg 
tgtccaagag 
tgttgtaaag 
tacacttaat 
aggttctgat 
tggtttacaa 
agaagaaaat 
ccagcttaag 
accagctgtt 
tgtcttgatt 
tgatgtgaaa 
tcaggaagca 
ccttaccaag 
tggaaaacca 
cccagaccgc 
agcttctcag 
cacctttgat 
ggaagacttg 
ggtggatgaa 
gcgtgaaaac 
aaatacattc 
gcagggtgtt 
taaccttcct 
gatgggacaa 
gagccagatg 
catgaaacgt 



cagaacgtct ttaaaaatct 



gcaaccaaag 
gactttatca 
cctgcgcaac 
acggcagaaa 
g.gggctggta 
gctcgtcctt 
accttgggac 
gagattgtac 
gatactgcgg 
acattggctc 
gccaatgttg 
attgatggcg 
atcaagttca 
atgtctagcc 
gaatacgatg 
tttaatgatt 
ctcaagatga 
cgccagattg 
ccagatttgt 
gtcgaagtca 
atgtctgggg 
aaaaatatgc 
ggcggtatgc 
tttggtggcg 
atggctaaca 



aaattcgctt 
agaaagttcg 
agattattaa 
ttatcaagtc 
aaacaacctt 
tgatgattgc 
aacagattga 
gtcaaggttt 
gtcgtttgca 
aaccaaatga 
cgcgtgagtt 
atactcgtgg 
ctggtacagg 
gtatccttgg 
aacaaaaagc 
tcatcgatca 
ttccaggtat 
ctcgtaaacg 
taaatccaag 
ataaattcat 
atatgaataa 
caaatatggg 
ctgacttatc 
gtttgaaagg 
aaatgaagaa 



acgtaaaaaa 
ggccttgct'c 
tgagcgtgca 
aatcgttgat 
acctaagatt 
tgctggtaaa 
ggcggatatt 
tgtgcctgtc 
ggagcaagcc 
gattgatgag 
aatcttgctt 
taatgctcag 
tggtgctgct 
tgaaaagatt 
tatgggggat 
ccttgaaatg 
attagatcag 
ggcaaacaat 
tgccattgtg 
ccgtcgccgt 
caaggacttt 
aatgatgaag 
aggaatggat 
agctctcgga 
taaaattggt 
agcgaagaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320- 

1380 

1440 

1500 

1560 

1572 



<210> 25 
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<211> 846 
<212> DNA 

<213> Streptococcus pneumoniae 
5 <400> 25 

atgtatctta ttgaaatttt aaaatctatc ttcttcggga ttgttgaagg aattacggaa 60 

tggttgccga tttccagtac aggtcatttg attttagcag aggagtttat ccaataccaa 120 

aatcaaaatg aagcctttat gtccatgttt aatgtcgtga ttcagcttgg tgctatttta 180 

gcagttatgg tgatttattt taacaagctc aatcctttta aaccgactaa ggacaaacag 240 

10 gaagttcgta agacttggag actatggttg aaggtcttga ttgctacttt acctttactt 300 

ggtgtcttta aatttgatga ttggtttgat acccacttcc ataacatggt ttcagttgct 360 

ctcatgttga ttatctacgg ggttgccttc atctatttgg aaaagcgcaa taaagcgcgt 420 

gctatcgagc caagtgtaac agagttggac aagcttcctt atacgaccgc tttctatatc 480 

ggactcttcc aagttcttgc tcttttacca gggactagcc gttcaggtgc aacgattgtc 54 0 

15 ggtggtttgt taaatggaac cagtcgttca gttgtgacag aatttacctt ctatcttggg 600 

attcccgtta tgtttggagc tagtgcctta aagattttca aatttgtgaa agccggagaa 660 

ctcttgagct ttgggcaatt gtttttgctc ttggtcgcga tgggagtagc ttttgcggtc 720 

agcatggtgg ctattcgctt cttgaccagc tatgtgaaaa aacacgactt cacccttttt 780 

ggtaaatacc gtatcgtgct tggtagtgtt ttgctacttt acagttttgt ccgtttattt 840 

20 gtataa 84 6 

<210> 26 

<211> 1290 

<212> DNA 

25 <213> Streptococcus pneumoniae 

<400> 26 

atgggattat ttgaccgtct attcggaaaa aaagaagaac ctaaaatcga agaagttgta 60 
aaagaagctc tggaaaatct tgatttgtct- gaagatattg agcctgcctt cacagaagct 120 

30 gaggaagttt ctcaagaaga agcagaggtt gaaagttctg aagaatctgt gttccaagaa 180 
gaggatagtc aagacacagt cgaagaaaat ctggatttag agccagttgt agaggtttct 240 
caagaagaag tagaagaatt tccaaactca caagaagtca cagaggaaga gaagcttgag 300 
cacgaaggaa ctgtagaaga aaataatttt gaagtgcttg aaccagaagc tcctcaaaca 360 
gaagaaactg ttcaggaaaa atatgaccgc agtcttaaga aaactcgcac aggtttcggt 420 

35 gcccgcttga atgccttctt tgctaacttc cgctctgttg acgaagaatt tttcgaggaa 4 80 
ctggaagaac tgttgattat gagtgatgtt ggtgtccaag tcgcttctaa cttaacggag 540 
gaactacgtt acgaagccaa gcttgaaaat gccaagaaac ctgatgcact tcgtcgtgtc 600 
atcattgaga aattggttga gctttatgaa aaggatggta gctacgatga aagcatccac 660 
ttccaagata acttgacagt tatgctcttt gttggtgtga atggtgttgg gaaaacaact 720 

40 tctatcggaa aactag.ccca ccgctacaaa cgagctggta agaaggtcat gctggttgca 780 
gcagatacct tccgtgcggg tgcagtagct cagctagctg aatggggccg acgagtagat 840 
gttccagtag taactggacc tgaaaaagct gatccagcca gcgtggtctt tgatggtatg 900 
gaacgtgccg tggctgaagg tatcgatatt ctcatgattg atactgctgg tcgtctgcaa 960 
aataaggata accttatggc tgagttggaa aagattggtc gtattatcaa acgtgttgtg 1020 

45 ccagaagcac cacatgaaac. cttcttggca cttgatgcat caacaggtca aaatgcccta 1080 
gtacaggcca aagaattttc gaaaatcaca cctttaacgg gaattgtttt gactaagatt 114 0 
gatggaactg ctcgaggagg tgtggttcta gccattcgtg aagaactcaa tattcctgta 1200 
aaattgattg gttttggtga. aaaaatcgat gatattggag agtttaactc agaaaacttc 1260 
atgaaaggtc tcttggaagg tttaatctaa 1290 

50 

<210> 27 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 27 

atgtatattg aaatggtaga tgaaactggt caagtttcaa aagaaatgtt gcaacaaacc 60 
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caagaaattt tggaatttgc agccaaaaaa ttaggaaaag aagacaagga gatggcagtc 120 

acttttgtga ccaatgagcg tagtcatgaa cttaatctgg agtaccgtga caccgaccgt 180 

ccgacagatg tcatcagcct tgagtataaa ccagaattgg aaattgcctt tgacgaagag 240 

gatttgcttg aaaatccaga attggcagag atgatgtctg agtttgatgc ctatattggg 300 

5 gaattgttca tctctatcga taaggctcat gagcaggccg aagaatatgg tcacagcttt 360 

gagcgtgaga tgggcttctt ggcagtacac ggctttttac atattaacgg ctatgatcac 420 

tatactccgg aagaagaagc ggagatgttc ggtttacaag aagaaatttt gacagcctat 480 

ggactcacaa gacaataa 4 98 

10 <210> 28 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 
15 <400> 28 

atgagtattc gagtaattat tgccggtttt aagggaaaga tgggccaggc tgcttgtcag 60 

atggtattga ctgatccaga cttggacttg gtggcagttt tggatccttt tgagtctgag 120 

tcagaatggc agggtattcc tgttttcaag gataaggctg atttagctgg ttttgaagcg 180 

gatgtctggg tagattttac tactccagct gttgcctacg aaaatacacg ttttgctctt 240 

20 gaaaatggct ttgctccagt agttggaacg actggtttca cgagtgaaga aattgcagag 300 

ctaaaagaat tttctcgtgc ccaagacttg ggtggcctga ttgcccctaa ctttgccttg 360 

ggtgctgtct tactcatgca atttgcgacg caggctgcca aatatttccc aaatgtggag 420 

attattgagc tccatcatga caagaaaaag gatgctccga gtggaacagc cattaaaaca 480 

gctgagttga tggcagaggt tcgagagtca attcagcaag gtgcagcaga tgaggaagag 540 

25 ctgattgctg gtgctcgtgg tgctgacttt gatggtatgc gcatccactc agttcgtttg 600 

ccaggcttgg tagctcatca ggaagtcatc tttggcaatc agggagaagg gttgaccctc 660 

cgtcatgact cctatgatcg catctccttc atgacaggag tcaatttggg aattaaagaa 720 

gttgtcaagc gtcatgagct tgtctatgga ttagaacact tattatga 768 

30 <210> 29 
<211> 276 
<212> DNA 

<213> Streptococcus pneumoniae 
35 <400> 29 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcag cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
40 aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 30 
<211> 921 
<212> DNA 
45 <213> Streptococcus pneumoniae 

<400> 30 

atgactaaaa cagccttttt atttgctggt caaggtgccc agtatctagg gatgggacgg 60 

gatttctatg atcagtatcc gattgttaaa gaaacgattg atcgagcgag tcaggtgctc 120 

50 ggttatgatt tgcgttatct catcgatacg gaagaggaca aactcaatca gacccgctat 180 

acgcaaccag ccattctagc gacttcggtt gctatctacc gtttattgca agaaaagggc 240 

tatcagcctg atatggtcgc tggtttgtct cttggagaat actctgcctt ggtggcaagc 300 

ggcgccttgg attttgaaga tgcggttgcc ttggtagcta agcgtggagc ctatatggaa 360 

' gaagcggctc. ctgctgactc tggcaagatg gtagcagttc tcaatacgcc agtagaggtc 420 

55 attgaagaag cctgtcaaaa agcttctgaa cttggagtgg ttactccagc caactataac 480 

acacctgcac aaatcgtcat tgctggagaa gtggttgcag ttgatcgagc ggttgaactt 540 

ttgcaagaag caggtgccaa acgcttgatt cctcttaagg tgtcaggtcc ctttcacacc 600 



12 



WO 01/49721 



PCT/US00/35604 



gctctccttg agccagctag ccagaaacta gctgaaactc tagctcaggt aagtttttca 660 
gattttactt gtcccctagt cggcaataca gaagctgctg tgatgcaaaa agaggacatt 720 
gctcagctct tgacgcgtca ggtcaaggaa cccgttcgtt tctatgaaag tattggggtc 780 
atgcaagaag caggcataag caactttatc gagattggac cggggaaagt cttgtcaggt 840 
5 tttgttaaaa aaattgatca aactgctcac ttagctcatg tggaagatca agcgagttta 900 
gtagcacttt tagaaaaata g 921 

<210> 31 

<211> 732 

10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 31 

atgaaactag aacataaaaa tatctttatt acaggttcga gtcgtggaat tggtcttgcc 60 
15 atcgcccaca agtttgctca agcaggagcc aacattgtct taaacagtcg tggggcaatc 120 
tcagaagaat tgctcgctga gttttcaaac tatggtatca aggtggttcc catttcagga 180 
gatgtatcag attttgcaga cgctaagcgt atgattgatc aagctattgc agaactgggt 240 
tcagtagatg ttttggtcaa caatgcaggg attacccaag atactcttat gctcaagatg 300- 
acagaagcag attttgaaaa agtgctcaag gtcaatctga ctggtgcctt taatatgaca 360 
20 caatcagtct tgaaaccgat gatgaaagcc agagaaggtg ctatcattaa tatgtctagt 420 
gttgttggtt tgatggggaa tattggtcaa gctaactatg ctgcttctaa ggctggcttg 4 80 
attggcttta ccaagtctgt ggcacgcgag gtcgctagtc ggaatatacg agtcaatgtg 54 0 . 
attgctccag gaatgattga gtctgatatg accgctatct tatcagataa gattaaggaa 600 
gctacactag ctcagattcc gatgaaagaa tttgggcagg cagagcaggt tgcagatttg 660 
25 acagtatttt tagcaggcca agattatcta actggtcaag* tggttgccat tgatggtggc 720 
ttaagtatgt ag 732 

<210> 32 
<211> 831 
30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 32 

atgggagtga aaaagaaact aaagttgact agtttgctag gactgtctct gttaatcatg 60 

35 acagcctgtg cgactaatgg ggtaactagc gatattacag ccgaatcggc tgatttttgg 120 

agtaaattgg tttacttctt tgcggaaatc attcgctttt tatcgtttga . tattagtatc 180 

ggagtgggga ttattctctt tacggtcttg- attcgtacag tcctcttgcc agtctttcag 240 

gtgcaaatgg tggcttctag gaaaatgcag gaagctcagc cacgcattaa ggcgcttcga 300 

gaacaatatc caggtcgaga tatggaaagc agaaccaaac tagagcagga aatgcgtaaa 360 

40 gtatttaaag aaatgggtgt cagacagtca gactctcttt ggccgatttt gattcagatg 420 

ccggttattt tggccctgtt ccaagcccta tcaagagttg actttttaaa gacaggtcat 4 80 

ttcttatgga ttaaccttgg iSagtgtggat acaacccttg ttcttccgat tttagcagca 540 

gtattcacct ttttaagtac ttggttgtcc aacaaagctt tgtctgagcg aaatggcgct 600 

acgactgcga tgatgtatgg gattccagtc ttgattttta tctttgcagt ttatgcgcca 660 

45 ggtggagtcg ccctatactg gacagtgtct aatgcttatc aagtcttgca aacctatttc 720* 

ttgaataatc cattcaagat tatcgcagag cgcgaggccg tagtacaggc acaaaaagat 780 

ttggaaaata gaaaaagaaa agccaagaaa aaggctcaga aaacgaaata a 831 

<210> 33 
50 <211> 1230 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 33 

55 atgaagatta gtaagaggca cttattaaat tattccatct tgattcccta cttactttta 60 
tctattttgg gcttgattgt ggtctattcg accaccagtg ctattttaat tgaagaaggc 120 
aagagcgcct tgcagttggt tcgaaaccaa ggaatctttt ggattgttag tttgatactg 180 
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attgccttaa tttataaatt gagactagat tttttgagaa atgagcgact aatcatttta 240 
gttatattaa tagaaatgct tttattgttc ttggctcgtt ttattggtat ttcagtaaac 300 
ggggcatacg gttggatttc ggttgcagga gtaactattc agccagctga gtacttaaaa 360 
atcattatta tttggtattt agctcaccga ttctccaaac agcaagaaga aatagctact 420 
5 tatgattttc aagttttgac tcaaaatcaa tggcttcccc gtgcttttaa tgattggcga 480 
ttcgttctcc tagttctgat tggaagtttg ggaattttcc ctgatttagg aaatgcgact 540 
attttagtct tggtttcctt gattatgtat acagttagtg gaatcgctta tcgctggttt 600 
tcaaccattc tggcgctcgt atctgccact tctgtctttg tcttgaccac tatcagccta 660 
atcggtgttg agaccttttc aaaaattcca gtatttggct atgtagccaa gcgctttagt 720 

10 gcctttttta atccttttgc cgatcgtgct gatgcaggtc accagttagc taattcttat 780 
tttgccatgg tcaatggtgg ttggtttggt ctaggtcttg gaaactcgat tgaaaaacga 84 0 
ggttatttgc cagaagctca tacagacttt gtcttttcta tcgtgattga agaatttggc 900 
tttgttggtg ccagtcttat tttagctctc ttgtttttca tgattttgcg gattatcttg 960 
gtcggtatcc gagcggagaa tcctttcaat gccatggttg cactcggtgt cggagggatg 1020 

15 atgttggttc aggtatttgt caatatcgga gggatttcgg gcttgattcc atctacagga 1080. 
gtgactttcc ctttcttatc ccagggtgga aatagtcttc tagtcttatc agtggcagta 1140 
gcctttgtct taaatattga tgccagtgaa aaacgcgcta agttgtaccg agaattggaa 1200 
aatcaaccaa tgaaccttct gttgaagtag 1230 

20 <210> 34 

<211> 1260 
<212> DNA 

<213> Streptococcus pneumoniae 
25 <400> 34 

atgctcggaa ttttaacctt tattctggtt tttgggatta ttgtagtggt gcacgagttc 60 
gggcacttct actttgccaa gaaatcaggg attttagtac gtgaatttgc catcggtatg 120 
ggacctaaaa tctttgctca cattggcaag gatggaacgg" cctataccat tcgaatcttg 180 
cctctgggtg gctatgtccg catggccggt tggggtgatg atacaactga aatcaagaca 240 

30 ggaacgcctg ttagtttgac acttgctgat gatggtaagg ttaaacgcat caatctctca 300 
gcjtaaaaaat tggatcaaac agccctccct atgcaggtga cccagtttga ttttgaagac 360 
aagctcttta tcaaaggatt ggttctggaa gaagaaaaaa catttgcagt ggatcacgat 420 
gcaacggttg tggaagcaga tggtactgag gttcggattg cacctttaga tgttcaatat 480 
caaaatgcga ctatctgggg caaactgatt accaattttg caggtcctat gaacaatttt 540 

35 atcttaggtg tcgttgtttt ttgggtttta atctttatgc agggtggtgt cagagatgtt 600 
gataccaatc agttccatat catgccccaa ggtgccttgg ccaaggtagg agtaccagaa 660 
acggcacaaa ttaccaagat cggctcacat gaggttagca actgggaaag cttgatccaa 720 
gctgtggaaa cagaaaccaa agataagacg gcaccgactt tggatgtgac tatttctgaa 780 
aaggggagtg acaaacaagt cactgttaca cccgaagata gtcaaggtcg ttaccttcta 8 40 

40 ggtgttcaac cgggggttaa gtcagatttt ctatccatgt ttgtaggtgg ttttacaact 900 
gctgctgact cagctctccg aattctctca gctctgaaaa atctgatttt ccaaccggat 960 
ttgaacaagt tgggtggacc tgttgctatc tttaaggcaa gtagtgatgc tgctaaaaat 1020 
ggaattgaga atatcttgta cttcttggca atgatttcca tcaatattgg gatttttaat 1080 
cttattccga ttccagcctt ggatggtggt aagattgtgc tcaatatcct agaagccatc 1140 
• 45 cgccgcaaac cattgaaaca agaaattgaa acctatgtca ccttggccgg agtggtcatc 1200 
atggttgtct tgatgattgc tgtgacttgg aatgacatta tgcgactctt ttttagataa 1260 

<210> 35 

<211> 594 

50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 35 

atgtacgcat atttaaaagg aatcattacc aaaattactg ccaaatacat tgttcttgaa 60 
55 accaatggta ttggttatat cctgcatgtg gccaatcctt atgcctattc aggtcaggtt 120 
aatcaggagg ctcagattta tgtgcatcag gttgtgcgtg aggacgccca tttgctttat 180 
ggatttcgct cagaggatga gaaaaagctc tttcttagtc t'aatttcggt ctctgggatt 240 
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ggtcctgtat,,cagctcttgc tattatcgct gctgatgaca atgctggctt ggttcaagcc 300 

attgaaacca agaacatcac ctacttgacc aagttcccta aaattggcaa gaaaacagcc 360 

cagcagatgg tgctggactt ggaaggcaag gtagtagttg caggagatga ccttcctgcc 420 

aaggtcgcag tgcaagcaag tgctgaaaac caagaattgg aagaagctat ggaagccatg 480 

5 ttggctctgg gctacaaggc aacagagctc aagaaaatca agaaattctt tgaaggaacg 540 

acagatacag ctgagaacta tatcaagtcg gcccttaaaa tgttggtcaa atag 594 

<210> 36 

<211> 774 

10 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 56 

atgaagaata atcgtatttt agcactttct ggaaatgata tttttagtgg tggtggactg 60 
15 tcagctgatt tggctaccta taccttgaac ggcttgcatg ggtttgtagc agtgacttgt 120 
ttgacagcct tgacagaaaa aggatttgaa gtctttccaa ctgatgatac catttttcaa 180 
catgaattag atagcttgcg tgatgtggaa tttgggggaa ttaagattgg tcttctccct 240 
actgtcagtg tggctgagaa ggccttggac tttatcaaac aacgcccagg agtacctgtg 300 
gtgttggatc ctgtcttggt ctgcaaggaa acgcatgatg tag.ctgtcag tgagctctgc 360 
20 caagagttga ttcgcttctt cccttatgtc agtgtgatta cgcctaatct cccagaagca 420 
gaattattat ccggtcagga aattaaaacc ttggaagaca tgaaaactgc agcgcagaaa 480 
ttgcatgatt taggagcgcc agcagtcatt atcaagggag gcaatcgtct tagtcaggac 54 0 
aaggctgtgg atgtctttta tgatggacag acctttacta tcctagaaaa tccagttatc 600 
caaggccaaa atgctggtgc aggttgtacc tttgcctcta gcattgccag tcacctggtt 660 
25 . aaaggtgata aatttttgcc agcagtagaa agctctaagg ctttcgttta tcgtgctatt 720 
gcacaagcag atcagtatgg agtaagacaa tatgaagcaa acaaaaacaa ctaa 77 4 

<210> 37 
<211> 1239* 
30 <212> DNA 

<213> Streptococcus pneumoniae *" 

<400> 37 

atgattgaaa cggagaaaaa agaggagcga gtcctgctga ttggtgtgga attgcagggt 60 

35 atggacagtt ttgacctctc catggaagaa ttggctagtt tagcgaaaac ggcaggggca 120 

gtcgttgtag atagctacag acaaaaacgt gaaaaatatg attccaagac .cttcgtcggc 180 

tctggtaagt tggaagagat tgcgcttatg gtggatgcag aagaaatcac tactgtcatc 24 0 

gtcaacaatc gtctgacccc aaggcagaat gtcaatctag aggaagttct cggtgttaag 300 

gtcattgacc gtatgcagtt. gattttggat atctttgcca tgcgggctcg aagccatgaa 360. 

40 gggaagctcc aagtccacct agcccaattc aaatacctct tgcctcgctt ggttggtcag 420 

gggattatgc tcagccgtca ggcaggggga attggttccc gtggtcctgg tgaaagccaa 480 

ctggagctga accgtcgtag cgttcgcaat caaatcacgg atatcgagcg ccagcttaag 540 

gtggttgaga aaaatcgtgc gactgtcaga gaaaaacgtt tggagtctag cacttttaag 600 

attggtttga ttggttatac taatgctggg aaatcaacta tcatgaacat cttgaccagt 660 

45 aagacccagt atgaagcaga tgagctcttt gcgactctgg atgcgacaac caagagtatt 720 

catctgggag gcaatctcca agtaactttg acagataccg ttggctttat ccaagatttg 780 

ccgacagagt tggtgtccag tttcaagtca accttggaag aaagcaagca tgtggacctt 840 

ctggttcatg ttatcgatgc tagcaatcct taccacgagg- agcatgaaaa aacggttctc 900 

tccatcatga aagacctgga catggaagat attcctcact tgacgcttta taataaagcg 960 

. 50 gatttggtgg aggatttcac gcctacccaa* acgccatata ccctcatttc tgccaagtct 1020 

gaggacagtc gtgaaaactt gcaagcatta ttgctagata agattaagga aatttttgaa 1080 

gcatttaccc tgcgagtgcc tttttcaaag tcctacaaga ttcatgattt agagagtgtt 1140 

gcaattctgg aagaacgtga ttatca'ggaa gacggcgaag tgattacagg .ctacatttcc 1200 

gagaaaaata aatggaggtt agaagaattt tatgactga 1239 

55 

<210> 38 
<211> 483 
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<212> DNA 

<213> Streptococcus pneumoniae 



<400> 38 

5 atggcagaaa aaacatatcc tatgaccctt 
gaagaattga aattggttcg tcgaccagaa 
tacggtgacc tttcagaaaa cagtgagtac 
gaaggacaaa tctctagctt agaaacaaaa 
gcagttgccc aggacgaagt agcgattggt 
10 gacgaagaag aagtttatat tatcgtaggt 
gtttcaaatg aaagcccaat tgggcaggcc 
accattgaaa cgcctgttgg tagctatgat 
taa 

15 <210> 39 
<211> 570 
<212> DNA 

<213> Streptococcus pneumoniae 

20 <400> 39 

atgaccaaat tacttgtagg cttgggaaat 

aatgttggtt ttatgttgat tgatcaacta 

gataagatat ttcaagctga cctagcatcc 

gttaaaccaa cgacctttat gaatgaaagt 

25 tatggtttgg atattgacga tttacttatc 

aaaattcgtt taagagcaaa aggctcagca 

caacatatag gaactcaggt ctttaaccgt 

ggtatgtcag ttgttcatca tgttttgagt 

ttacagtctg ttgacaaagt tgacgattct 

30 gagaaaacaa tgcagaggta taacggataa 

<210> 40 
<211> 852 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 40 

atgattttaa ttacaggggc aaatggccaa 
gaacgtaatg aagaatacgt ggcagtagat 

40 atggttgaga aagtttttga agaggtgaaa 
accgctgttg atgcagcaga ggatgaagga 
gggacaaaaa atgtcgcaaa agcatctgaa 
acggactatg tctttgacgg taagaaacca 
ccagatccac agacagaata tggacgcact 

45 catgtgtcta atttctatat tatccgtact 
ttcgttttta ccatgcaaaa tcttgcgaaa 
cagtacggtc gtccgacttg gactcgtacc 
aatcgtaagg aatttggtta ttatcattt'g 
tatgattttg cagttgaaat tttgaaagat 

50 agtcaatttc cagccaaagc taaacgtccg 
aaagctactg gatttgttat tccaacttgg 
gaagtgagat aa 

<210> 41 
55 <211> 1224 
<212> DNA 

<213> Streptococcus pneumoniae 



gaggaaaagg aaaaacttga aaaagaatta 60 
gtggtagaac gcattaagat tgcccgttca 120 
gaagcagcta' aggatgaaca agcctttgtc 180 
atccgctatg ctgaaatcgt caatagcgac 240 
aaaacagtca ccatccaaga aattggtgag 300 
tcagctggtg cggatgcctt tgcaggtaag 360 
ttgattggca agaaaacagg tgatacagca 420 
gtaaaaatct tgaaggttga aaaaacagcc 480 

483 



ccaggggata aatattttga aacaaaacac 60 
gcgaagaaac agaatgtcac ttttacacac 120 
tttttcctaa atggagaaaa aatttatctg 180 
ggaaaagcag ttcatgcttt attaacttac 240 
atttacgatg atcttgacat ggaagttggg 300 
ggtggtcata atggtatcaa gtctattatt 360 
gttaagattg gaattggaag acctaaaaat 420 
aagtttgaca gggatgagta tatcggtatt 480 
gtaaactact atttacaaga gaaaaatttt 540 

570 



ttaggaacgg aacttcgcta tttattggat 60 
gtggctaaga tggacattac caatgaagaa 120 
ccgactttag tctaccattg tgcagcctac 180 
aaagagttgg acttcgccat caatgtgacg 240 
aagcatggtg caactctagt ttatatttct 300 
gttggacaag agtgggaagt tgatgaccga 360 
aagcgtatgg gggaagagtt agttgagaag 420 
gcctgggtat ttggaaatta tggcaaaaac 480 
actcataaga ctttaacagt tgtaaatgat 540 
ttggctgagt tcatgaccta cctagctgaa 600 
tcaaatgatg cgacagaaga cacaa'catgg 660 
acagatgtcg aagtcaagcc agtagattcc 720 
ctaaactcaa cgatgagcct ggccaaagcc 780 
caagatgcat tgcaagaatt ttacaaacaa 840 

852 
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<400> 41 

atgaaacgtt ctctcgactc aagagtcgat tacagtttgc tcttgccagt attttttcta 60 

ctggtcatcg gtgtggtggc tatctatata gccgttagtc atgattaccc caataatatt 120 

5 ctgcccattt tagggcagca ggtcgcctgg attgccttgg ggcttgtgat tggttttgtg 180 

gtcatgctct ttaatacaga atttctttgg aaggtgaccc cctttctata tattttaggc 240 

ttgggactta tgatcttgcc gattgtattt tataatccaa gcttagttgc atcaacgggt 300 

gccaaaaact gggtatcaat aaatggaatt accctattcc aaccgtcaga atttatgaag 360 

atatcctata tcctcatgtt ggctcgtgtc attgtccaat ttacaaagaa acataaggaa 420 

10 tggagacgca cggttccgct ggactttttg ttaattttct ggatgattct ctttaccatt 480 

ccagtcctag ttcttttagc acttcaaagt gacttgggga cggctttggt ttttgtagcc 54 0 

* attttctcag gaatcgtttt attatcaggg gtttcttgga aaattattat cccagtattt 600 

gtgactgctg taacaggagt * tgctggtttc ttagctatct ttattagcaa ggacggacga 660 

gcttttcttc accagattgg aatgccgacc taccaaatca atcggatttt ggcttggctc '720 

15 aatccctttg agtttgccca aacaacgact taccagcagg ctcaagggca gattgccatt 780 

gggagtggtg gcttatttgg tcagggattt aatgcttcga atctgcttat cccagttcga 84 0 

gagtcagata tgatttttac ggttattgca gaagattttg gctttattgg ctctgtcctg 900 

gttattgccc tctatctcat gttgatttac cgtatgttga agattactct taaatcaaat 960 

aaccagttct acacttatat ttccacaggt ttgattatga tgttgctctt ccacatcttt 1020 

20 gagaatatcg gtgctgtgac tggactactt cctttgacgg ggattccctt gcctttcatt 1080 

tcgcaagggg gatcagctat tatcagtaat ctgattggtg ttggtttgct tttatcgatg 114 0 

agttaccaga ctaatctagc tgaagaaaag agcggaaaag ttcc'attcaa acggaaaaag 1200 

gttgtattaa aacaaattaa ataa 1224 

25 <210> 42 
<211> 609 
<212> DNA 

<213> Streptococcus pneumoniae 
30 <400> 42 

atgggaaaaa tcatcggaat cactggggga attgcctcag gtaagtcaac tgtgacaaat 60 
tttctaaaac accaagggct ttcaagcagt ggattgccga cgcagtgttc caccaactac 120 
agaaaacctg gtggtcgtct gtttgaggct ttagtacagc actttgggca agaaatcatt 180 
cttgaaaacg gagaactcaa tcgccctctc atagctagtc tcatcttttc aaatcctgaa 240 

35 gagcaaaaat ggtctaatca aattcaaggg gagattatcc gtgaggaact ggctactttg 300 
agagaacagt tggctcagac agaagagatt ttcttcatgg atattcccct actttttgaa 360 
caggactaca gcgattggtt tgctgagact tggttggtct atgtggaccg agatgcccaa 420 
gtagaacgct . taatgaaaag ggaccagttg tccaaagatg aagctgagtc tcgtatggca 480 
gcccagtggc cttta.gaaaa aaagaaagat ttggccagcc aggttcttga taataatggc 54 0 

40 aatcagaacc agcttcttaa tcaagtgcat atccttcttg agggaggtag gcaagatgac 600 
agagattaa 609 

<210> 43 

<211> 1260 

45 <212> DNA 

<213> Streptococcus pneumoniae 



. <400> 43 
atgagaaaaa ttgttatcaa tggtggatta 

50 gctaaaaata gtgttgtggc cttaattcca 
ttggattgtg ttccagatat ttcggatgta 
ggagctactg ttaagcgtta tgacgatgtc 
attccaatgc cttatggtaa aattaacagt 
ctcttaggcc gttttggtga. agcgacagtt 

55 cgtccgattg acttacacct taaggcgttt 
ggagataaca tgaagttatc tgctaaagat 
gatacggtta gtgtgggagc aacgattaat 



ccactgcaag gtgaaattac tattagtggt 60 
gctattatat tggctgatga tgtggtgact 120 
gccagtcttg tcgaaatcat ggaattgatg 180 
ttggagattg atccaagagg tgttcaaaat 240 
cttcgtgcat cttactattt ttatgggagc 300 
ggtctaccgg gaggatgtga tcttggtcct 360 
gaagctatgg gtgccactgc tagctacgag 420 
acaggacttc atggtgcaag tatttacatg 480 
acgatgattg ctgcagttaa agcaaatggt 540 
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cgtactatta ttgaaaatgc agcccgtgaa cctgagatta ttgatgtagc tactctcttg 600 
aataatatgg gtgcccatat ccgtggggca ggaactaata tcatcattat tgatggtgtt 660 
gaaagattac atgggacacg tcatcaggtg attccagacc gcattgaagc tggaacatat 720 
atatctttag ctgctgcagt tggtaaagga attcgtataa ataatgttct ttacgaacac 780 
5 ctggaagggt ttattgctaa gttggaagaa atgggagtga gaatgactgt atctgaagac 840 
agcatttttg tcgaggaaca gtctaatttg aaagcaatca atattaagac agctccttac 900 
ccaggctttg caactgattt gcaacaaccg cttacccctc ttttactaag agcgaatggt 960 
cgtggtacaa ttgtcgatac gatttacgaa aaacgtgtaa atcatgtttt tgaactagca 1020 
aagatggatg cggatatttc gacaacaaat ggtcatattt tgtacacggg tggacgtgat 1080 
10 ttacgtgggg ccagtgttaa agcgaccgac ttaagagctg gggctgcact agtcattgct 1140 
gggcttatgg ctgaaggtaa aactgaaatt accaatattg agtttatctt acgtggttat 1200 
tctgatatta tcgaaaaatt acgtaattta ggagcggata ttagacttgt tgaggattaa 1260 

<210> 44 
15 <211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 44 

20 atgtcaagaa ttgaattttc accatctttg atgaccatgg atttggacaa attcaaagag 60 

cagattactt ttttgaatga taaagtagca tcttatcata tcgatattat ggatggccat 120 

tttgttccca afcattacctt gtctccttgg ttcattcaag aagttcaaaa aattagtgac 180 

acacctttat cagttcatct gatggtcaca gacccaacct tttgggtaga tcaagttctc 240 

gatttacaat gtgagtatat ttgtattcat gctgaagttc tgaatggtct tgcttttcgt 300 

25 ttgattgata aaattcatga tgcaggtcta aaggctggtg ttgtccttaa tcctgaaaca 360 

cctgtttcta caatctttcc ctacattgat ttacttgaca aagtaactat tatgactgta 420 

gatccaggtt ttgcaggaca acgctttttg gagtctacct tgtataaaat ccaagaactc 4 80 

cgtcagctta gagttcagaa tggttatcac tacatcattg agatggatgg ttcttcgagt 54 0 

cgtaagactt tcaaacaaat tgatgtggca ggaccagata tttatgttat aggtcgcagt 600 

30 ggattatttg gtttggatga cgatattgcc aaagcctggg atatctgttc tagagattac 660 
gaagaaatga ccggaaaaac aatgccaatc aaataa 696 

<210> 45 

<2il> 1125 

35 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 45 

atgagaaata tggctttgac agcaggtatc gttggtttgc caaacgttgg taaatcaaca 60 

40 ctatttaatg caattacaaa agcaggagca gaggcagcaa actacccatt tgcgactatt 120 
gatccaaatg ttggaatggt ggaa'gatcca gatgaacgcc tacaaaaact aactgaaatg 180 
ataactccta aaaagacagt tcccacaaca tttgaattta cggatattgc agggattgta 240 
aaaggagctt caaaaggaga agggctaggg aataaattct tggccaatat tcgtgaagta 300 
gatgcgattg ttcacgtagt tcgtgctttt gatgatgaaa atgtgatgcg cgagcaagga 360 

45 cgtgaagacg cctttgtaga tccacttgca gatattgata caattaatct ggaattaatt 420 
cttgctgact tagaatcagt gaacaaacga tatgcgcgtg tagaaaagat ggcacgtacg 480 
caaaaagata aagaatcagt agcagaattc aatgttcttc aaaagattaa accagtccta 540 
gaagacggga aatcagctcg taccattgaa tttacagatg aggaacaaaa ggttgtcaaa 600 
ggtcttttcc ttttgacgac taaaccagtt ctttatgtag ctaatgtgga cgaggatgtg 660 

50 gtttcagaac ctgactctat cgactatgtc aaacaaattc gtgaatttgc agcgacagaa 720 
aatgctgaag tagtcgttat ttctgcgcgt gctgaggaag aaatttctga attggatgat 780 
gaagataaaa aagagtttct tgaagccatt ggtttgacag aatcaggtgt agataagttg 840 
acgcgtgcag cttaccactt gcttggattg ggaacttact tcacagctgg tgaaaaagaa 900 
gttcgcgctt ggactttcaa acgtggtatg aaggctcctc aagcagctgg tattatccac 960 

55 tcagactttg aaaaaggctt tattcgtgca gtaaccatgt catatgaaga tctagtgaaa 1020 
tacggatctg aaaaggccgt aaaagaagct ggacgcttgc gtgaagaagg aaaagaatat 1080 
atcgttcaag atggcgatat catggaattc cgctttaatg tctaa 1125 
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<210> 46 
<211> 333 
.<212> DNA 
5 <213> Streptococcus pneumoniae 



<400> 46 

atggaaatcg aaaaaaccaa tcgtatgaat 
acagataagc aaatgaatta tatagagctt 
10 atagctgagg agtttggtgt tagtcgtcag 
aagattctgg aagattatga gatgaaattg 
cagatttttg accaaatctt ggagcgctat 
"gaaattttaa caagcattga taatagagaa 

15 <210> 47 
<211> 672 
<212> DNA 

<213> Streptococcus pneumoniae 
20 <400> 47 

atgaccttag aatgggaaga atttctagat ccttacattc aagctgttgg tgagttaaag 60 

attaaacttc gtggtattcg taagcaatat cgtaagcaaa ataagcattc tccaattgag 120 

tttgtgaccg gtcgagtcaa gccaattgag agcatcaaag aaaaaatggc tcgtcgtggc 180 

attacttatg cgaccttgga acacgatttg caggatattg ctggcttacg tgtgatggtt 24 0 

25 cagtttgtag atgacgtcaa ggaagtagtg gatattttgc acaagcgtca ggatatgcga 300 

atcatacagg agcgagatta cattactcat agaaaagcat caggctatcg ttcctatcat 360 

gtggtagtag aatatacggt tgataccatc aatggagcta agactatttt ggcagaaatt 420 

caaattcgta ctttggccat gaatttctgg gcaacgatag aacattctct caactacaag 480 

taccaagggg atttcccaga tgagattaag aagcgactgg aaattacagc tagaatcgcc 540 

30 catcagttgg atgaagaaat gggtgaaatt cgtgatgata tccaagaagc ccaggcactt 600 

tttgatcctt tgagtagaaa attaaatgac ggtgtaggaa acagtgacga tacagatgaa 660 

gaatacaggt aa 672 



gcgctctttg aattttatgc ggcgcttttg 60 

tactacgctg atgattacag tcttgctgag 120 

gctgtctatg acaatatcaa gcgaacagaa 180 

cacatgtact cggactacat tgtccgtagt 24 0 

cccaaggatg attttctgca ggagcagata 300 
taa 333 



<210> 48 
35 <211> 588 
<212> DNA 

<213> Streptococcus pneumoniae 
<4O0> 48 

40 atggaactta atacacacaa tgctgaaatc ttgctcagtg cagctaataa gtcccactat 
ccgcaggatg aactgccaga gattgcccta gcagggcgtt caaatgttgg taaatccagc 
tttatcaaca ctatgttgaa ccgtaagaat ctcgctcgta catcaggaaa acctggtaaa 
acccagctcc tgaacttttt taacattgat gacaagatgc gctttgtgga tgtgcctggt 
tatggctatg ctcgtgtttc taaaaaggaa cgtgaaaagt gggggtgcat gattgaggag 

45 tacttaacga ctcgggaaaa tctccgtgcg gttgtcagtc tagttgacct tcgtcatgac 
ccgtcagcag atgatgtgca gatgtacgaa tttctcaagt attatgagat tccagtcatc 
attgtggcga ccaaggcgga caagattcct cgtggtaaat ggaacaagca tgaatcagca 
atcaaaaaga aattaaactt tgacccaagt gacgatttca tcctcttttc atctgtcagc 
aaggcaggga tggatgaggc ttgggatgca atcttagaaa aattgtga 

50 

<210> 49 
<211> 294 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 49 

atgaaaacaa gaaaaatccc tttgcgcaag tctgttgtgt ctaacgaagt gattgataag 60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
588 
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10 



15 



cgtgatttgc tccgcattgt taagaacaag 
aaggccaatg gccgcggcgc ttatatcaaa 
aagaagaagg tctttaaccg cagctttagc 
ttgatcgctt atgtggatca caaagtgaaa 

<210> 50 
<211> 312 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 50 

atgttaaaac cctctattga taccttgctc 
atcttggaag caaaacgtgc ccacgaattg 
aagtctgaaa aatcaactct tcgcgcttta 
cacccagatc cagaaggaaa acgtgaagca 
cgcaaagaag aagaagaaaa gaaaatcaaa 
gaaaaaattt aa 



gaaggacaag tctttattga tcctacgggc 120 

ctagacaatg cagaagccct agaggcgaaa 180 

atggaagtgg aagaaagctt ttatgacgag 240 

agaagagagt tgggacttga ataa 294 



gacaaggttc cttcaaaata ttcactcgta 60 
gaagcaggtg ccccagcaac tcaaggtttc 120 
gaagaaatcg aatcaggaaa cgttacaatt 180 
gtgcgtcgcc gtatcgaaga agaaaaacgc 240 
gagcaaattg ctaaagaaaa agaagatggt 300 

312 



<210> 51 
20 <211> 312 
<212> DN£ 

<213> Streptococcus pneumoniae 



<400> 51 
25 atgtcattaa 
atcatccaaa 
cttgatgcgc 
gaagtagctg 
atcttgattt 
30 aaagaaatct 



catcaaaaca 
tcgggaaaaa 
gtgaattaat 
aaattttgga 
tgtttaaaca 
aa 



acgtgccttc ctcaacagcc aggcacacac cctcaaacct 60 
tggactcaac gaccaaatca aaaccagcgt ccgtcaagct 120 
caaggttact ctcttacaaa acacagatga aaacatccac 180 
agaagaaatc ggtgtggata cagtccaaaa aataggacgc 240 
atctagcaag aaagaaaatc gcaagatttc taagaaagtc 300 

312 



<210> 52 
<211> 528 
<212> DNA 
35 <213> Streptococcus pneumoniae 



40 



45 



<400> 52 

atggcgattg 

ccaagcctgc 

gcttggaaca 

gcgggtattg 

gagaaatttg- 

cgtgctatga 

atgacagata 

gtccaacatg 

aaaatcactg 



aaaattatat 
aggcgcaggg 
accctgatgg 
gcattatcgt 
ggattgatta 
aggaattcca 
tacgagcagc 
actcaatcaa 
aaaagtacgg 



accagatttt 
aatcaaggct 
aacgccagag 
agtgtcaaat 
cgtttactgg 
ctatgacaaa 
ccaccgtgca 
aacgcagatt 
accgattaca 



gctgtggaag 
gttttggtcg 
atgaagcaat 
aacaccaaaa 
gccttgaagc 
aaggaagtgg 
gggattcggt 
aaccgaactc 
tataaaaaag 



cagtctatga 
atttggataa 
ggctacatga 
aacgcgttca 
ccttcacatt 
tcatggttgg 
caattttagt 
gtgagcgtcg 
gaatttaa 



tctgacagtc 60 
taccctcatt 120 
ccttcgggac 180 
acgagcagtt 24 0 
tggtattgac 300 
tgaccagctc 360 
caaacccttg 420 
tgttatgaga 480 
528 



<210> 53 
<211> 1368 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 53 

atgtttcgaa aaattttaat tgccaatcgt ggtgaaattg cggttcgtat tatccgtgcg 60 
55 gcacgtgaat tggggattgc gacggtagcg gtttattcaa ctgctgataa ggaagctctt 120 
catacgctgt tggcagatga agcagtttgt attggtcctg gcaaggcaac agagtcttat 180 
ctcaatatta atgcagttct atcagctgca gtcttgactg aggcagaagc tattcaccct 240 
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ggttttggat ttctcagtga aaattccaaa 
aagtttatcg gtccatctgg tcatgttatg 
gctcagatga ttaaagcagg tgtgcctgtt 
tctgaagaag ctttgattgt tgctgaaaaa 
5 ' gcaggtggag gtggtaaagg gattcgtaag 
tttgaaactg cctctagtga ggccaaggcc 
cgggttatct atccagctcg gcacattgag 
gtgattcact tgggtgaacg ggattgttct 
gaaagtccct cgattgcaat cggaaaaacg 

10 cgagcggcag agtttgttgg ctatgagaat 
gcaagtagca atttctattt catggagatg 
acagagtttg tttcaggtgt tgatatcgtt 
cctttgtctg ttaagcaaga agatattgtc 
aatgcagaaa acccagcctt taactttgct 

15 ctgccaagtg gtggagttgg cttgcgcgtg 
ccgccttatt atgatagtat gattgccaaa 
gccttgatga aaatgcaacg tgccctctat 
gcagatttcc agcttgacct catttcagat 
tgcttcttga tggaaacctt cttacctaaa 

20 

<210> 54 
<211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 

25 

<400> 54 

atgatttaca aagtttttta tcaagaaaca 
cgcgcgcttt acctagacat egataccagc 
caacttgtcg aagaaaatcg cccagagtac 
30 aaattgctcg attacgaaaa agaaactggc 

<210> 55 

<211>. 1011 

<212> DNA 

35 <213> Streptococcus pneumoniae 

<400> 55 

atgaaggata gatatatttt agcatttgag 
ttgaaaaacg acgatgagct cttgtccaat 

40 cgttttggtg gcgtagtgcc cgaagtagcc 
tgtatcgagg aggcattggc agaagcaggg 
gttacctacg gaccaggctt ggtcggagcc 
tttgcttggg ctcacggact tccactgatt 
gcagctcaga gtgtggagcc tttggagttt 

45 cacacagagt tggtctatgt ttctgaggct 
gatgacgcgg ttggcgaggc ctatgataag 
gcaggtcgtg agattgacga gctggctcat 
gccatgatta aggaagataa tctggagttt 
aatcttcatc acaatgccga gcaaaaggga 

50 tccttccaag cagcagttat ggacattctc 
tatcctgtta aaaccctatt tgtggcaggt 
cgcttagcag ccgaaatcac agatgtcaag 
gacaatgcag gtatgattgc ctatgccagc 
ggctgggacc tcaatgccaa accaagtctt 

55 

<210> 56 
<211> 1809 



tttgcgacca tgtgtgaaga aataggtatc 300 
gatatgatgg gggataaaat caatgcgcgt 360 
ataccaggtt cagatggaga agtgcataac 420 
attggctatc ctgttatgct caaggcttca 480 
gttgaaaaac cagatgacct cgtttctgcc 540 
aattatggca atggtgccat gtacatagaa 600 
gttcaaatcc taggtgatga gcatggacat 660 
cttcaaagga ataaccaaaa ggttttggaa 720 
ctgcgtcatg aaataggtgc tgctgctgtt 780 
gcaggaacca ttgaatttct tcttgatgaa 84 0 
aatactcgtg ttcaggtaga acatccagta 900 
aaggaacaga tttgcattgc ggcaggtcag 960 
ctacgcggtc atgccatcga gtgtcgtatc 1020 
ccaagtccag gtaagattac taatctctat 1080 
gattcagcag tttatccagg ttataccatt 1140 
atcatagtac acggcgaaaa tcgttttgac 1200 
gaattagaaa ttgaaggagt gcagaccaat 1260 
cgcaatgtca ttgctgggga ttatgatact 1320 
tatcaagaaa aagaataa 1368 



aaagaacgta gcccacgccg tgaaacaaca 60 
tcagaacttg agggccgtat cactgctcgc 120 
aatatcgaat atatcgaact cttgtctgac 180 
gccttcgaaa ttacggagtt ctaa 234 



acatcctgtg atgagaccag tgtcgccgtc 60 
gtcattgcta gtcaaattga gagtcacaaa 120 
agtcgtcacc atgtcgaggt cattacagcc 180 
attaccgaag aggacgtgac agctgttgcg 24 0 
ttgctagttg gtttgtcagc tgccaaggcc 300 
cctgttaatc acatggctgg gcacctcatg 360 
cccttgctag ccctcttggt cagcggcgga 420 
ggcgattaca agattgttgg ggagacacga 480 
gtcggccgtg tcatgggctt gacctatcct 540 
caggggcagg atatttatga tttcccccgt 600 
tcattctctg gtttgaagtc agcctttatc 660 
gaaagcctgt ctacagaaga tttgtgtgct 720 
atggcaaaaa ccaagaaggc tttggaggaa 780 
ggtgtggcag ccaataaagg tctcagagaa 84 0 
gttatcatcc cccctctgcg actctgcgga 900 
gtcagcgagt ggaacaaaga aaacttcgca 960 
gcctttgata ccatggaata a 1011 
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<212> DNA 

213> Streptococcus pneumoniae 
<400> 56 

'5 atgtgtggaa ttgttggtgt tgttggaaac 
cttgaaaagc ttgaataccg tggctatgat 
gataaccatt tggtgaaggc ggttggtcgt 
gttgagggaa caactggtat cggacatact 
gacaatgctc acccacaccg ctctgagaca 

10 attgagaact atcttgaaat caaggaagaa 
acagatactg aaatagccgt tcatttgatt 
gttcttgaag cctttaaaaa agctcttcat 
attgactctg aaaatccaga tgtcatctat 
ggtcttgggg aaggctacaa tatggtctgc 

15 aaccaataca tggaaattca tgaccaagag 
gttcaagact atgatggtaa cagtcgtgaa 
tcagatatcg gtaagggaac ttatccttac 
actgttatgc gtaaactcat tcaagcctac 
cctgctatca ttaaggctgt tcaagacgca 

20 tcttaccatg caggatttgc ttctaagaaa 
gaacttggaa tctcatctga gtggggctac 
ttcatcttta tcagccaatc tggtgaaaca 
aatgaaatgg gaattccaag cttaacagtg 
gaagccaact ataccatgct ccttcacgca 

25 gcctatacag cgcaaatcgc agcccttgcc 
ggtaatgcta aagcgcaagc ctttgacctg 
attgaatcaa ctctttcaga gaaagaaacc 
acaactcgta acgcctttta catcggacgt 
agtctcaaac tcaaagagat ttcttatatc 

30 aagcacggaa ccattgcctt gattgaagaa 
ccagttcttg ccaaccatac tcgtggaaat 
gtcctcacta tcgcagaaga gaatgtagcc 
gtacatccat acctctcacc aatttcaatg 
gcaaccctcc accgtggcct cgatgtggac 

35 gtagaataa 

<210> 57 
<211> 723 
<212> DNA 
40 <213> Streptococcus pneumoniae 

<400> 57 

atgatacgta tcgaaaatct cagtgtctcc 
tcactagtgc tccatggacc aacaattacc 

45 tcaacactat taaaaggtat gctgggaatt 
gacaaggaag ttaaaaaatc cttacaccga 
gactacaact ttcccatcaa ggtcaaggaa 
cctctctttc gaagtttaaa ggctaaacat 
gtcggcctag ctgactacgc tgaacgtcaa 

50 cgggtcttga ttgceagatg tttggtgcag 
tttgctggga ttgactctgt cagtgaggaa 
aaagctggga agacggttct catcgttcac 
gatcaagtct tacttgtcaa tcgagaagtg 
accgaaacca atctaaaaga agcttacggt 

55 tga 

<210> 58 



acaaatgcaa ctgatatttt gattcaaggg 60 
tctgcgggaa tttttgtcct agatggtgct 120 
attgcagaat tgtctgccaa gacagctggt 180 
cgttgggcta ctcacggaaa accaactgag 240 
gaacgttttg tcttggtgca taatggggtg 300 
taccttgcag gtcaccactt caaggggcag 360 
ggaaaatttg cggaagaaga agggctctca 420 
attatccgtg gttcatatgc ctttgccttg 480 
gtagcgaaaa acaaatctcc acttttgatt 540 
tcagatgcta tggctatgat tcgtgaaacc 600 
ttggtaatcg tcaaggctga tagcgtggaa 660 
cgtgctagct atactgcgga acttgacttg 720 
tacatgctta aggaaattga tgagcaacca 780 
acggatgatg ctggtcaagt agtggttgct 840 
gaccgcatct acatccttgc agctggaaca 900 
atgttggaag aattgacaga tacaccagtt 960 
ggtatgccac ttctcagcaa gaaaccactc 1020 
gcggatagtc gtcaagtttt ggtcaaggct 1080 
acaaatgttc caggttcaac cctctcacgt 1140 
ggacctgaaa ttgccgtggc atcaactaaa 1200 
ttccttgcaa aagcagtcgg agaagcaaat 1260 
gttcatgaat tgtcaatcgt agctcagtct 1320 
attgaagcca aggttcgtga acttcttgaa 1380 
ggtcaagatt actacgtagc catggaagca 14 40 
cagtgtgaag gttttgcggc aggagaactc 1500 
ggaacgcctg tt.ttggctct cttgtcagat 1560 
atccaagagg tcgcagcccg tggtgccaaa 1620 
aaagataccg acgatatcgt ccttacgacc 1680 
gtcgtaccaa cgcaattagt cgcttacttt 1740 
aaaccacgta accttgccaa gtcagtaacg 1800 

1809 



tacaaagaaa cgttggcact taaggatatt 60 
ggcatcattg gtccaaacgg cgctgggaaa 120 
atcccacatc aaggtcaggc atttctcgat 180 
attgcctatg tcgaacaaaa aatcaatatc 240 
tgcgtctcgt taggactatt tccctctatt 300 
tggaagaaag tgcaagaggc ccttgaaatc 360 
attagtcaac tgtctggagg tcaattccag 4 20 
gaagccgact atatcctctt ggatgaaccc 4 80 
atca'tcatga atacgctgag agatttgaaa 540 
cacgacctca gcaagattcc ccactacttc 600 
attgcctttg gtccaacaaa agaaactttt .660 
aatcaactct ttttcaatgg aggtgaccta 720 

723 
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<211> 2223 
<212> DNA 

<213> Streptococcus pneumoniae 

5 <400> 58 

atgccgaaag aagtgaattt aacaggcgaa 
acggaagagg atgttcattt tgtccataag 
ggtcaatatc gcaaatcagg cgagccttat 
ttagctaagc taaagctgga tgctgtaaca 
10 gaagatacag atgcgacctt ggacgatttg 
attgttgacg gagttaccaa gcttggcaag 
gcggaaaatc atcgcaagat gctcatggcc 
aaactgtctg accgcttgca caatatgc'gg 
gagcgtattt ccaaagaaac catggaaatc 

15 tccagtgtca aatgggaatt agaagacttg 
tacaagatta cccatatgat gaaggaaaag 
gtagtcacaa aattagagga gtatacgaca 
cgtcccaagc atatttactc aattttccgc 
gaaatctatg atctgattgc tattcgttgt 

20 atgcttggtt acgtgcatga attttggaaa 
gccaaccgca aggccaatgg ttatcagtct 
ccgattgaat tccagattcg aaccaaggaa 
gctcactggg cttataagaa aggtataaag 
ggaatgaact ggatcaagga gatgatggag 

25 tttgtggact ctgttaagga aaactatctg 
ggagctgtcc gttcccttcc caaagattca 
accaaggtcg gtgaaaaagc aactggtgcc 
accaagttaa agacagggga tcaggttgaa 
agccgtgact ggctcaatat ggtcaagact 

30 tttaaaaacc aagataagga attgtctgtc 
ttccaagaaa atggctatgt ggcaaataaa 
ctgcaaaaga ccagttacaa gacagaagac 
atcggtgcga ttaccgtctt taaccgtctg 
gccaaggcca aggctgaggc agaggagctt 

35 aaagaaactc tcaaggtcaa gcatgagggg 
ctagtgcgga ttgctaagtg ttgtaacccc 
accaagggtc gtggtgtggc tattcaccgt 
aactacgagc aacgtctcct tgatgtggaa 
gagtatctgg cccatatcga tatctacggt 

40 ctgcaagttc tttcaaatac aaccaagaat 
gatatgaagt ttgctaatat ccatgtgtcc 
acggttgtcg ataaaattaa gagtgtgcca 
tag 

45 <210> 59 . 
<211> 1479 
<212> DNA 

<213> Streptococcus pneumoniae 

50 <400> 59 

atgtctaatt gggacactaa atttttgaaa 
attccagctg aaagtcatgt gttgcctaac 
aatttgactt taaatatccc aattattacc 
atggccattg ctattgctcg tgcaggcggt 

55 gctcaacaag cagacgaggt tcgtaaggta 
ccgttcttct tgacgcctga acatacaatt 
cgcatcagtg gtgttccagt tgttgaaaca 



gaagttgtcg ctttaaccaa agaatattta 60 
gccttggtct atgctgttga atgccacagt 120 
atcattcacc ctatccaagt ggcaggtatt 180 
gtagcttgtg gattcttgca tgatgtggtg 240 
gaaagagagt ttggtcctga tgtgcgggtg 300 
gtcgagtaca aatcgatcga ggagcaatta 360 
atgtctgagg acatccgcgt tattttggtc 420 
accctgaaac atcttcgaaa agacaagcag 480 
tatgccccac ttgcccatcg tttggggatt 540 
tctttccgtt atctcaatcc aacggagttt 600 
cgcagggagc gtgaggcctt ggtggatgag 660 
gaacgtcact tgaaagggaa gatttatggt 720 
aaaatgcagg acaagagaaa acggtttgag 780 
attttagata cccaaagtga tgtttatgcc 84 0 
ccgatgccag gtcgcttcaa agactatatt 900 
atccatacga ctgtttatgg accaaaaggg 960 
atgcacgagg tggctgagta cggggttgcg 1020 
gggcaagtta acagcaagga atcagct'att 1080 
ctccaagacc aggctgatga tgctaaggaa 1140 
gctgaggaga tttacgtttt taccccagat 1200 
ggaccgattg attttgccta cgaaatccat 1260 
aaggtcaatg gccgcatggt tccactgaca 1320 
attatcgcca acccgaactc ctttggacct 1380 
agcaaggcgc gcaataagat tcgccagttc 14 40 
aacaagggtc gtgagatgct gatggctcag 1500 
tttatggaca agcgccacat ggatcaagtt 1560 
tccctctttg cggccattgg ttttggggaa 1620 
actgaaaagg aacgccgtga ggaagagcgt 1680 
gtcaaaggtg gcgaggtcaa ggttgaaaat 17 40 
ggagtggtta ttgaaggtgc ttctggtctc 1800 
gtgcctggtg acgatattgt tggctacatt. 1860 
gtggactgta tgaacctgcg tgcccaagaa 1920 
tgggaagacc agtactctag ctcaaataag 1980 
ctcaaccgta caggactgtt gaacgatgta 2040 
atttcaacgg tcaatgccca accaaccaag 2100 
ttcggtattg ccaacctctc tacactgacc 2160 
gaagtttact c.tgtcaaacg gaccaacggc 2220 

2223 



aaaggtttta cctttgatga tgtattgctt 60 

gatgcagatt taacaactaa attggcagat 120 

gctgccatgg acacagttac agagagtcaa 180 

ctcggagtta tccataaaaa catgtcaatt 24 0 

aaacgttctg aaaatggagt tattattgat 300 

gctgaagcag atgagcttat gggtcgttac 360 

cttgaaaatc gtaaattggt tggtattttg 420 



23 



WO 01/49721 



PCT/US00/35604 



acaaaccgag atcttcgttt tatttcagat tataatcaac caatttcaaa ccatatgact 480 
agtgaaaatc ttgttactgc tcctgtgggt acggatcttg caacggctga gagtattctt 540 
caagagcatc gtattgaaaa acttccgttg gtcgatgaag aaggcagtct ttctggtttg 600 
atcactatca aagatattga aaaagttatt gagtttccaa atgcggctaa agatgagttt 660 
5 ggtcgtcttc tagttgcagg tgcagtaggt gttacttcag atacatttga acgtgcagag 720 
gctctttttg aggcaggagc ggatgcgatt gttattgata ctgcacatgg tcattctgca 780 
ggtgtcttgc gtaaaattgc cgagattcgt gctcatttcc cagatcggac tttgattgct 840 
ggaaatattg ctactgctga aggtgcacgt gccctttatg aagcgggtgt agacgttgtt 900 
aaggttggta ttggaccagg ttctatctgt actactcgtg tgattgctgg tgttggtgtt 960 

10 ccgcaagtaa cagctatcta cgatgctgca gctgttgcgc gcgaatatgg taaaacgatt 1020 
attgctgacg gtgggatcaa gtattctgga gatattgtaa aagcacttgc tgcaggtgga 1080 
aatgctgtta tgcttggatc tatgtttgct ggaactgatg aagctccagg cgaaactgaa 1140 
atcttccaag gacgtaaatt caagacttac cgtggtatgg gatcaattgc tgctatgaag 1200 
aaaggttcaa gcgaccgtta tttccaaggt tctgtcaatg aagcaaacaa gcttgttcca 1260 

15 gaaggaattg aaggtcgtgt tgcttataaa ggagcggcag ctgatattgt tttccaaatg 1320 
attggtggta ttcgctctgg tatgggttac tgtggtgcag ctaaccttaa agaactacac 1380 
gataatgctc aatttattga aatgtctggt gctggtttga aagaaagcca tcctcatgat 14 40 
gtgcaaatta ctaatgaggc accaaattat tctatgtaa 14 79 

20 <210> 60 

<211> 1947 
<212> DNA 

<213> Streptococcus pneumoniae 
25 <400> 60 

atgacagaag aaatcaaaaa tctgcaggca caggattatg atgccagtca aattcaagtt 60 
ttagagggct tagaggctgt tcgtatgcgt ccagggatgt acattggatc aacctcaaaa 120 
gaaggtcttc accatctagt ctgggaaatt gttgataact caattgacga ggccttggca 180 
ggatttgcca gccatattca agtttttatt gagccagatg attcgattac tgttgtggat 24 0 

30 gatgggcgtg gtatcccagt cgatattcag gaaaaaacag gccgtcctgc tgttgagacc 300 
gtctttacag tccttcacgc tggaggaaag ttcggcggtg gtggatacaa ggtttcaggt 360 
ggtcttcacg gggtggggtc gtcagtagtt aatgcccttt ccactcaatt agacgttcat 420 
gttcacaaaa atggtaagat tcattaccaa gaataccgtc gtggtcatgt tgtcgcagat 480 
cttgaaatag ttggagatac ggataaaaca ggaacaactg ttcacttcac accggaccca 54 0 

35 aaaatcttca ctgaaacaac aatctttgat tttgataaat taaataaacg gattcaagag 600 
ttggcctttc taaatcgcgg tcttcaaatt tcaattacag ataagcgcca aggtttggaa 660 
caaaccaagc attatcatta tgaaggtggg attgctagtt acgttgaata tatcaacgag 720 
aacaaggatg taatctttga tacaccaatc tatacagacg gtgagatgga tgatatcaca 780 
gttgaggtag ccatgcagta cacaactggt taccatgaaa atgtcatgag tttcgccaat 84 0 

40* aatattcata cacatgaagg tggaacgcat gaacaaggtt tccgtacagc cttgacacgt 900 
gttatcaatg attatgctcg taaaaataag ttactgaaag acaatgaaga caacctaaca 960 
ggggaagatg ttcgcgaagg cttaactgca gttatctcag ttaaacaccc aaatccacag 1020 
tttgaaggac aaaccaagac caaattggga aatagcgaag tggtcaagat taccaatcgc 1080 
ctcttcagtg aagccttctc cgatttcctc atggaaaatc cacagattgc caaacgtatc 1140 

45 gtggaaaaag ggattttagc tgccaaggct cgtgtggctg ccaagcgtgc gcgtgaagtc 1200 
acacgtaaaa aatctggttt ggaaatttcc aaccttccag ggaaactagc agactgttct 1260 
tctaataacc ctgctgaaac agaactcttc atcgtcgaag gagactcagc tggtggatca 1320 
gccaaatctg gtcgtaaccg tgagtttcag gctatccttc caattcgcgg taagattttg 1380 
aacgttgaaa aagcaagtat ggataagatt ctagctaacg aagaaattcg tagtcttttc 1440 

50 acagccatgg gaacaggatt tggcgcagaa tttgatgttt cgaaagcccg ttaccaaaaa 1500 
ctcgttttga tgaccgatgc cgatgtcgat ggagcccaca ttcgtaccct tcttttaacc 1560 
ttgatttatc gttatatgaa accaatccta gaagctggtt atgtttatat tgcccaacca 1620 
ccaatctatg gtgtcaaggt tggaagcgag attaaagaat atatccagcc gggtgcagat 1680 
caagaaatca aactccaaga agctttagcc cgttatagtg aaggtcgtac caaaccgact 1740 

55 attcagcgtt ataaggggct aggtgaaatg • gacgatcatc agctgtggga aacaaccatg 1800 
gatcccgaac atcgcttgat ggctagagtt tctgtagatg atgctgcaga agcagataaa 1860 
atctttgata tgttgatggg ggatcgagta gagcctcgtc gtgagtttat cgaagaaaat 1920 



24 



WO 01/49721 



PCT/USOO/35604 



gctgtctata gtacacttga tgtctaa 1947 

<210> 61 
<211> 267 
5 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 61 

atgggattta ctgaagaaac agtacgtttt aaattggacg attccaataa aaaagaaatt" 60 

10* agcgaaactt tgacagatgt ttatgcttcg ttgaacgata agggttacaa cccaattaac 120 

caaatcgtag gttacgtatt gagtggagac cctgcctacg ttcctcgtta taataatgca 180 

cgaaatcaaa tccgtaagta tgagcgtgat gaaatcgttg aggaattggt tcgctactac 240 

ctcaaaggac aaggagtcga tctataa 267 

15 <210> 62 
<211> 597 
<212> DNA 

<213> Streptococcus pneumoniae 
20 <400> 62. 

atggtcaact atccacataa agtttcatca caaaaaagac aaacatctct ttctcaaccc 60 
aaaaatttcg caaatcgagg aatgtctttt gaaaagatga tcaatgctac caacgactac 120 
tatttgtctc agggcttggc tgttatacat aagaaaccaa ctcctattca aatcgtacaa 180 
gtggactatc cacaacgaag tcgtgccaag attgttgaag cctattttcg acaagcttca 240 

25 acgacggact attctggcgt ttataatgga tattacatcg actttgaagt- caaggaaaca 300 
aaacaaaaac gtgcgattcc gatgaaaaat tttcatccac atcagattca gcatatggaa 360 
caagtccttg cccaacaagg aatctgcttt gtccttcttc acttttcttc tcagcaagaa 420 
acctacttat tgccggcatt cgatttgatt cgcttctatc atcaagataa gggacaaaaa 480 
tcaatgccac ttgaatatat tcgagaatat ggatatgaaa tcaaggctgg tgccttccct 540 

30 caaattcctt atctcaatgt tatcaaagaa catttattag gtggtaaaac aagatga 597 

<210> 63 

<211> 867 

<212> DNA 

35 <213> Streptococcus pneumoniae 



<400> 63 

atggctctat ttagtaaaaa agataagtat attcgaatca atcccaatcg ttcggttagg 60 
gaaaaacctc aagctaagcc agaggttcca gatgaattat tttcccagtg tccaggctgt 120 

40 aagcatacca tctatcagaa ggatctggga agtgaacgta tctgtccgca ctgtagctat 180 
acctttcgta tttctgccca agaacgcttg. gctttgacga ttgatatggg aaccttcaaa 240 
gaattgttta cagggattga aagcaaggat cccttgcatt tccctggtta ccaaaagaaa 300 
ctggcatcta tgcgtgaaaa aacaggtctg catgaagccg ttgtgacagg aactgctctt 360 
attaaaggtc agactgtggc tcttgggatt atggattcta actttatcat ggcttctatg 420 

45 ggtacggttg taggtgaaaa aatcactcgt ttgtttgagt atgcgactgt cgaaaaattg 480 
ccagttgtcc tattcacagc ctctggtgga gcccgtatgc aggaaggaat catgagtctc 540 
atgcagatgg ptaagatctc tgcggcggtt aaacgccatt caaatgctgg tctcttttac 600 
ctgaccattt tgacagatcc aacgactggt ggtgtgacag cttctttcgc tatggaaggc 660 
gatatcattc tggctgaacc acagagcttg gttggttttg ctggacgtcg tgtgattgaa 720 

50 aatacggttc gtgaaagctt gcctgaggat ttccaaaagg cagaattcct attagaacat 780 
ggctttgtgg atgctattgt caaaagaaga gacttaccag atacgattgc tagcctagtc 840 
agattgcatg gagggagtcc tagatga 867 

<210> 64 
55 <211> 420 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 64 

atgagaatta tgggattgga cgtcggttca aaaacggtag gggtggcgat tagcgatccg 60 
cttggtttta cagctcaagg gcttgaaatc atccagataa atgaagaaca aggccaattt 120 
5 ggttctgacc gcgttaagga attggttgat acttacaagg tggaacgatt tgtagtgggc 180 
ttgcctaaaa acatgaacaa tacaagtgga ccgcgcgtag aagctagtca agcatacgga 24 0 
gcaaagctag aagagttttt tggtttacca gtagactatc aggatgaacg cttgacaaca 300 
gtggctgctg agcgcatgtt gattgaacaa gcagatatca gt-cgcaataa gcgcaagaaa 360 
gtcattgata agttagcagc tcagctgatt ttacaaaatt atttagatag aaaattttaa 420 

10 

<210> 65 
<211> 1197 
<212> DNA 

<213> Streptococcus pneumoniae 

15 

<400> 65 

atggcaaaac ttactgttaa agacgttgac ttgaaaggta aaaaagtcct cgttcgtgtt 60 
gacttcaacg taccattgaa agatggcgta atcactaacg ataaccgtat cacagcagct 120 
cttccaacta ttaagtacat catcgaacaa ggtggacgtg caattctttt ctctcacctt 180 

20 ggacgtgtga aagaagaagc tgataaagct ggtaaatcac ttgctcctgt agcagcagac 240 
ttggcagcaa aacttggtca agatgttgtt* ttcccaggtg tcactcgtgg tgctgaattg 300 
gaagcggcaa tcaacgctct tgaagatgga caagttctct tggttgaaaa cactcgttac 360 
gaagatgttg acggcaagaa agaatctaaa aacgatcctg aacttggtaa atactgggca 420 
tcacttggag atggt'atctt cgtaaacgat gcattcggta cagctcaccg tgcacacgca 480 

25 tctaacgttg gtatctcagc aaacgttgaa aaagcagttg ctggtttcct tcttgaaaac 540 
gaaattgcct acatccaaga agcagttgaa actccagaac gtccattcgt ggctatcctt 600 
ggtggttcaa aagtttcaga caagatcggt gttatcgaaa acttgcttga aaaagctgat 660 
aaagtcctta tcggtggtgg gatgacttac acattctaca aagcacaagg tatcgaaatc 720 
ggtaactcac ttgtagaaga agacaaattg gatgttgcga. aagctcttct tgaaaaagca 780 

30 aatggtaaat tgatcttgcc agttgactca aaagaagcta acgcatttgc tggttacact 840 
gaagtgcgtg acactgaagg tgaagcagtt tctgaaggct tccttggtct tgacatcggt 900 
ccaaaatcta tcgccaaatt tgacgaagct ttgactggtg ccaaaacagt tgtatggaac 960 
ggacctatgg gtgtatttga aaacccagat ttccaagctg gtacaatcgg tgtgatggac 1020 
gctatcgtga aacaaccagg agttaaatca atcatcggtg gtggtgactc agctgccgca 1080 

35 gcgattaacc ttggccgtgc agacaagttc tcatggatta gtacgggtgg tggagcatca 114 0 
atggaacttc ttgaaggtaa ggttcttcca caacttgcag ccttgacaga aaaataa 1197 

<210> 66 
<211> 498 
40 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 66 

atgttaaaat cagaaaaaca atcacgttat caaatgttaa atgaagaatt gtccttccta 60 
45 ttggaaggcg aaaccaatgt tttggctaat ctttccaacg ccagtgctct cataaaatca 120 
. cgttttccta ataccgtatt tgcaggcttt tatttgttcg atggaaagga attggtttta 180 
ggccccttcc aaggaggtgt ttcctgcatc cgtattgcac taggcaaggg tgtttgtggt 240 
gaggcagctc actttcagga aactgttatt gttggagatg tgacgaccta tctcaactat 300 
atttcttgtg atagtctagc taaaagtgaa attgtggtgc cgatgatgaa gaatggtcag 360 
50 ttacttggag ttctggatct ggattcttca gagattgagg attacgatgc tatggatcga 420 
gattatttgg aacaatttgt cgctattttg cttgaaaaga cagcatggga ctttacgatg 480 
tttgaggaaa aatcttaa 4 98 

' <210> 67 
55 <211> 630 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 67 

atgacaatcg aactattgac tccctttacc 
aaaaaacgca'aacaagttgg gattttaggg 
5 ctcattgttg cggatcaagt acggcaacag 
gaataccaac ctcctcacgt tgataaaaag 
atgcttgagt tggcaattga gggaattgac 
cgcaagggta tttcctacac ctacgacacc 
acggattatt actttatcat cggtgccgac 
10 attgatgaac tggttgacat ggttcagttt 
gggacttcct atccagttat -ctgggtggac 
gtgcgtgcct tccttgccca acjgtcggaaa 
gactacatcg agaaggaggg gctctactga 

15 <210> 68 
<211> 768 
<212> DNA 

<213> Streptococcus pneumoniae 

20 <400> 68 

atgaatattg caaaaatagt cagagaagcg 
tttgcgacag gcatttttga tgaatttatc 
gatggtgcag ttgttggtgg tattggttgg 
atccaaaaag gcaagagttt gcaagacaac 

25 gaaggctacc gaaaggcact gcggttgatg 
gtgaccttta tcaatacagc aggtgcttat 
ggggaagcta ttgctcgcaa tctcatggaa 
attattatcg gtgaaggtgg ttcaggcggg 
tggatgctgg aaaattctat ctatgccatt 

30 tggaaggacg gtactcgcgc catggaagca 
ctgttagaaa tggacgtggt ggataaggtg 
ctgattaaga gtgtcaaaaa agaactccaa 
ctagaagagt tgctggaaga acgctatcaa 

35 <210> 69 

<211> 510 

<212> DNA 

<213> Streptococcus pneumoniae 

40 <400> 69 

atgattataa aagtagaaat ggcagatgtt 
tttcgtgaaa cctttgcgta tgataatacg 
gcttatagtc tgaaaacttt gtcaactgag 
attatgcatg aggaggagat agctggtttt 

45 gagagagaat tagaggacgc ttttgaaatt 
ggatttggac taggtaagca actgtttgaa 
ttttcttggg cttggctagg tgtttgggag 
cgatatggtt ttgaaaaatt tagccaacat 
gattggttac tgagaaagaa attaaggtaa 

50 

<210> 70 
<211> 1590 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 70 

atgttacggg ggactgcttt gctaacggct 



PCT/US00/35604 



aaggtagagt tggagccaga aatcaaggag 60 
gggaatttta accctgttca caatgcccat 120 
ttgggactgg atcaagttct gctcatgcct 180 
gaaaccatcc ctgaacacca tcgtctcaag 240 
ggcctagtca ttgaaaccat tgagttggag 300 
atgaagattt tgacagagaa gaatccagat 360 
atggttgact atctgcctaa gtggtaccga 420 
gtgggggttc agcgtccacg ctacaaggta 480 
gtaccgctca tggatatctc gtccagcatg 540 
cccaactttc tcctacctca gccagtgcta 600 

630 



cgtgagcaga gtcgcttgac aaccttggac 60 
caattacatg gtgaccgttc ttttcgtgat 120 
cttggagacc aagctgtaac agtggttggt 180 
ctcaaacgga attttggcca accacatcca 240 
aaacaggctg agaaatttgg ccgtccagtt 300 
cctggtgtcg gagcggaaga acgtggtcaa 360 
atgagtgacc tgaaagttcc tattatcgcc 420 
gctctggctc tagctgtcgc ggaccgtgtc 480 
ctcagtccag aaggctttgc ttccatttta 540 
gcagaactga tgaaaatcac ttcgcatgaa 600 
atttctgaag taggactttc tagtaaagaa 660 
acggagctgg ctagactttc acaaaaaccg 720 
cgatttagaa aatactaa 768 



gaggt'gttgg ctaaaattgc caaacaaacc 60 
gaagagcagt tacaggaata ctttgaagag 120 
ttgggaaatc ctgactctga aacctatttc 180 
ctcaaagtca actggggaag tgctcaaact 240 
caacgcctct atgtgctaca aaaattccaa 300 
ttcgcacttg aacttgctac aaaaaatagt 360 
cataatacaa aagctcaagc cttttataat 420 
cattttatgg ttggtcaaaa agtagatacg 480 

510 



agtaacttta. tcagtcgcct actcggggct 60 
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gtttacatta tcccttggta catctggatg ggggcttatg cagctaaggc aaatggtctc 120 
tttaccatgg gttacactat ctatgcttgg ttcttgttgg tttcaacagc ggggattcca 180 
gttgcggtgg ccaagcaagt tgccaagtat aataccatgc gagaagaaga gcatagcttt 240 
gccctgattc ggagcttctt aggctttatg acaggactag gcctggtttt tgctttagtc 300 
5 ttgtatgtct ttgctccttg gctagcagac ttgtctggcg tgggcaaaga cttgatccca 360 
atcatgcaaa gcttggcttg gggagtcttg attttcccgt ctatgagtgt tatccgagga 420 
tttttccaag ggatgaataa cctcaaaccc tatgccatga gccaaattgc tgagcaggtc 480 
attcgtgtta tctggatgct cctagcaacc tttatcatta tgaagctcgg ttcaggagat 540 
tatctagcag ccgttaccca atcaaccttt gctgcctttg .tcggtatggt agccagtttt 600 

10 gcagtcttga tttatttcct tgcccaagaa agttcactca aaagagtctt tgaaacagga 660 
gataagatta acagtaagcg tctcttggtt gataccatta aggaagccat tccttttatc 720 
ctgacagggt ctgccatcca gatcttccag attttggatc agctgacctt tatcaatagt 780 
atgagctggt ttaccaacta cagcaatgag gacttggttg tcatgttttc tta.tttctca 840 
gccaatccta ataaaatcac gatgattttg atttctgtag gggtttcgat tgggagtgtt 900 

15 ggtttgccac ttttgacgga aaactatgtc aagggggact tgaaagcggc ttctcgtctc 960 
gttcaggaca gtctcaccct actctttatg ttcttgctac cagcaacggt tggagtggtt 1020 
atggtaggag aacctcttta tacggtcttc tatggtaagc cagatagttt ggctctgggc 1080 
ttatttgtct ttgcagtttt gcagtctatt attttaggct tgtacatggt cttgtctcca 1140 
atgcttcagg ccatgttccg caaccgcaag gccgttctct attttatcta tggttctatt 1200 

20 gccaagctag tcttgcaact acctaccatc gccctcttcc acagttatgg tcctttgatt 1260 
tcaacaacca ttgctctcat cattcctaac gtcttgatgt atcgggatat ttgtaaagta 1320 
actggtgtca agcgcaaggt gattttgaag cgaaccattt taatcagttt gctgacccta 1380 
gtcatgtttc tgttaatagg aaccatccag tggctgttag gatttttctt ccaaccaagt 1440 
ggacgtttgt ggagcttctt ttatgtagct cttgtcggtg ccatgggggg tggactttat 1500 

25 atggttatga gtctgcgtac ctatttatta gataaggtaa taggaaaagc ccaagcagat 1560 
•cgcctgcgag caaaatttaa gctttcgtaa 1590 

<210> 71 

<211> 468 

30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 71 

atgtcagata agattggctt attcacaggc tcatttgatc c'gatgacaaa tgggcatctg 60 

35 gatatcattg aacgggcgag cagactcttt gataagctct atgtcggtat tttttttaat 120 

ccccacaaac aaggatttct tcctatcgaa aatcgtaaac gggggctaga aaaggctttg 180 

ggacatctgg aaaatgttga agtcgtggct tctcatgatg aattggtggt cgatgttgca 240 

aaaagattgg gtgctacttg tctagtgcgt ggtttgagga atgcgtcgga tttgcaatat 300 

gaagccagtt ttgattacta caatcatcag ctgtcttctg atatagagac tatttattta 360 

40 catagtcgac ctgaacatct ctatatcagt tcatcaggcg ttagagagct tttgaagttt 420 

ggtcaggata ttgcctgcta tgttcccgag agtatttgga ggaaataa 4 68 

<210> 72 
<211> 432 
45- <212> DNA 

<213> Streptococcus pneumoniae 

<400> 72 

atgacgattt tgtttgtggt tatcagtgct tcctttctgt atatggtttc tcttagcatg 60 

50 aaaccctatc aaacagctaa aagtgaagga gaaaaattag ctcagcagta tgcaggatta 120 

gagcaggccg atcaggttga tttatacaat ggcttggaat cttattacag cgttcttggt 180 

cgtaataaac agcaagaagc - acttgctgtt ctgattggaa aagatgatca taagatttac 240 

gtttatcagc taaatcaggg tgtttcacaa gaaaaagcag aaacggtttc taaggaaaag 300 

ggagctggcg aaattgacaa gattatcttt ggtcgttatc aagataagcc aatctgggaa 360 

55 gtcaagtcag gatctgattt ttatctagta gattttgaaa caggagcatt ggtcaacaag 420 

gagggcctat ga 432 
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<210> 73 
<211> 732 
<212> DNA 

<213> Streptococcus pneumoniae 
*<400> 73 

atgattgata ttcattcgca cattgtcttt gatgtagatg atggtcccaa 

gaaagtaagg ctctcttgac agaagcctac aggcaggggg tgcgaaccat 

tctcaccgtc gcaagggcat gtttgaaact ccagaagaga agatagcaga 

caggttcggg aaatagctaa ggaagtcgcg agtgacttgg tcattgctta 

atttactaca cgccagatgt tttggataag ctggaaaaca atcggattcc 

aatagtcgtt atgccttgat agagtttagt atgaacactc cttatcgcga 

gccttgaata aaatattgat gttgggaatt actcccgtca ttgcccacat 

gatgttcttg aaaataatga aaaacgcgtt cgagagctga tcgatatggg 

caaataaata gttcacatgt cctcaaatcc aaactttttg gagaacctta 

aaaaaaagag cgcagtattt cttggagcgt gatttggttc atatcattgc 

cataatgtgg acggcagacc cccccatatg gcagaagcat atgaccttgt 

tacggagaag cgaaggctca ggaacttttt atagacaatc ctcgaaaaat 
caactaattt ag 

<210> 74 
<211> 927 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 74 

atgtctacaa tcgataaaga aaaatttcag tttgtaaaac gtgacgattt 
actattgatg cgccagcata ttcttactgg aaatcagtgt ttaaacaatt 
aaatcaactg tagtcatgtt gggaatcttg gtagccatca ttttgataag 
ccaatgtttt ctaagtttga tttcaatgat gtcagcaagg taaacgactt 
tatatcaagc caaatgcgga gcattggttc ggtactgaca gtaacggtaa 
gacggtgtct ggttcggagc tcgtaactcc atcctcattt ctgtgattgc 
aacttggtta tcggtgtttt tgtcggtggt atttggggta tttcaaaatc 
gtcatgatgg aagtttacaa cgtcatctca aacatcccac ctcttttgat 
ttgacttact caatcggagc tggattctgg aatctgattt ttgccatgag 
tggattggta ttgccttcat gatccgtgtg caaatcttgc gctatcgtga 
aacttggcgt cacgtacttt gggaacacca accttgaaga ttgttgccaa 
cctcaattgg tatctgttat tgtgacaacc atgactcaaa tgcttccaag 
tacgaagcct tcttgtcttt "cttcggtctt ggattaccga ttacagtgcc 
cgtttgattt cggattattc acaaaacgta acaaccaatg cttacttgtt 
ttgacaaccc ttgtcttggt atccttgtcc cttttcgtag ttggtcaaaa 
gctagtgatc cacgtacaca tagatag 



gtcaagagag 60 
tgtctctacc 120 
aaactttctt 180 
tggggctgaa 240 
gaccctcaat 300 
tattcatagt 360 
agagcgctat 420 
ctgttacacg 480 
taaattcatg 540 
aagtgatatg 600 
ttcccaaaaa 660 
tgtaatggat 720 
732 



tgcctctgaa 60 
tatgaagaaa 120 
tttcatctac 180 
tagtgttcgt 240 
atcgctcttt 3.00 
gacagtgatt 360 
agttgaccgt 420 
tgttattgtc 480 
cgtaacaaca 540 
cttggaatac 600 
aaatatcatg .660 
ctttatctca 720 
aagtttgggt 780 
ctggattcca 840 
cttagcggat 900 
927 



<210> 75 
45 <211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 75 

50 atgtataacc tattattaac cattttatta gtattatctg ttgtgattgt gattgcaatt 60 

ttcatgcaac caaccaaaaa ccaatccagc aatgtatttg atgccagttc aggtgatttg 120 

tttgaacgca gtaaagctcg cggttttgaa gctgtaatgc agcgtttgac agggatttta 180 

gtctttttct ggctagccat tgccttagca ttgacggtat tatcaagtag ataa 234 

55 <210> 76 

<211> 1110 
<212> DNA 
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<213> Streptococcus pneumoniae 
<400> 76 

atgtttcgta gaaataaatt atttttttgg accacagaaa ttttactctt aaccatcatc 60 
5 ttttacctat ggagacagat gggatctttg attaaccctt ttgttagcgt gcttaataca 120 
attatgattc catttttatt agggggcttt ctttattatt tgacaaaccc tattgttact 180 
ttcttaaata aagtctgtaa actcaatcgt ttgcttggta ttttaattac cttgtgtact 240 
ttggtctggg gaatggtcat aggtgttgtc tatctcttac ctattttgat taatcagtta 300 
tctagtttga ttatatctag tcaaactatt tatagtcgag tacaagactt aatcatagac 360 

10 ttatctaatt atcctgcgct ccagaatttg gatgtagaag ctacaattca gcagttaaac 420 
ttatcctatg ttgatattct tcaaaatatc ctaaatagcg tatcaaatag tgtggggagc 480 
gtcttgtcag ctcttatcag tactgttttg attttgatta tgactccagt ttttttggtt 540 
tatttcttat tagatggaca taaattcttg cccatgcttg aaagaacgat tctaaagagg 600 
gatcgcttgc atattgcagg cttattaaag aatttaaatg cgacgattgc tcgctatatt 660 

15 agtggagttt cgattgacgc aatcattata ggttgtttgg cttatattgg ctatagtatt 720 
attggtttaa aatatgcttt agtttttgcc attttttctg gtgtagccaa tttaattcct 780 
tatgtggggc caagtattgg tttgattcct atgatcatcg caaatatatt cactgtaccc 840 
catagactgc tgattgcagt gatttatatg cttgttgttc agcaggtaga tggcaatatc 900 
ttatatcctc gaattgtagg aagtgttatg aaggttcatc caatcacgat tttagtttta 960 

20 cttttgttgt caagcaatat ctatggtgta gttggaatga ttgtcgcagt gccaacctat 1020 
tctatcttga aagaaatttc taagttctta tcccgtttgt atgaaaatca taaaataatg 1080 
aaagaacgag aaagagaatt agctaagtaa 1110 

<210> 77 
25 <211> 1356 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 77 

atgtatcaag cactttatcg aaaatataga agtcaaaact tctcccagtt agttggtcaa 60 
gaagttgtgg ctaagactct taaacaagcg gtggagcaag agaaaataag tcacgcttat 120 
cttttttctg gtcctcgtgg aacgggaaaa accagtgttg ctaaaatctt tgccaaggct 180 
atgaactgtc ccaatcaagt gggtggcgaa ccttgcaata actgctatat ttgtcaagca 240 
gtgacggacg gtagtttaga agatgtcatt gaaatggatg cagcttctaa taatggggta 300 
gatgaaattc gcgaaattcg tgataaatct acctatgcgc ctagccttgc tcgttataag 360 
gtttatatca tagatgaggt tcacatgctg tctacagggg cttttaatgc cctcctaaag 420 
acgctggaag aaccaacaca gaatgtagtc tttattttgg ccactactga attgcacaag 480 
attcctgcta ctattctatc ccgtgtgcaa cgttttgagt ttaaatcaat taagacacag 540 
gatattaagg aacatattca ctatatctta gaaaaagaaa atatcagttc tgaaccagag 600 
gctgtggaaa tcattgccag acgggcggaa ggtggaatgc gggacgcctt gtctattttg 660 
gatcaagccc tgagtttgac acagggaaat gagctgacga ctgctatctc tgaagaaatt 720 
actggcacca ttagcctatc agccttggat gattatgtgg cggccttgtc tcaacaggat 780 
gttcccaaag. ctttgtcttg cttgaatctt ctttttgaca atggtaagag catgactcgt 840 
tttgtgaccg atcttttgca ctatttaaga gacttgttaa ttgttcaaac agggggagaa 900 
aatactcatc atagttcagt ctttgtagaa aatttggcac ttcctcaaaa aaatctgttt 960 
gaaatgattc gcttagcaac agtgaattta gcagatatta agtctagttt gcagcccaag 1020 
atttatgctg aaatgatgac cgtccgtttg gcggaaatca agcccgaacc agctctatca 1080 
ggagcggttg aaaatgaaat tgctacgctg agacaggaag ttgcccgtct caaacaagag 114 0 
ctttctaatg caggtgcggt tcctaaacaa gttgcaccag ctcctagtcg accagctacg 1200 
ggcaaaacag tctatcgtgt cgatcgcaat aaagtgcaat ctatcttaca agaggccgtc 1260 
gaaaatcctg atttaacacg tcaaaatcta attcgtttgc agaatgcatg gggagaggta 1320 
attgaaagtc taggtgggcc ggacaagctc tgctag 1356 

<210> 78 
55 <211> 1989 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 78 

atgtttcgat taaccaataa gttagcggta 
tatccttttg cgctggctgt tctcttggca 
5 accttcaatc ctaagattgc ggaaatccgt 
tttggta,tgt ttgtcgtcac ccttgcgtca 
gtcatgaaga aacgttccaa ggaactagga 
catcttatca gtatgacctt taaggagtta 
ggtatcggta ttggagcctt gtttgacaag 

10 aaattgaagg ttgagctggt tgctaccttc 
gtcttcggtt tgattttcct aggcctcatg 
aatgccctcc agctctctcg tgagaaagct 
ctccaaacca ttcttggttc cataagttta 
aaagatcctc ttacagcctt aacaaccttc 

15 acttatctct tgtttaatgc agggattacc 
aaatactatt accaaccaaa taacctcata 
aaaaatgcag ttggactagc aactatcgct 
tcagcagcga caagcatttt caattccgca 
gattttgggg tttcaggaca aaatgttgaa 

20 tttgcaagtg acaatggtta taagattaaa 
ggtgttgcga accaagaagg aaataagtta 
caacccacaa cagttttcat ggtatttgac 
aaactgtctc tatcaggaaa tgaggtcggt 
cagaaaactc taattctgaa tgatcatcaa 

25 tttattgtca accatgtccc aaatcagttt 
gttgtacctg atttacaagc ctttttgaac 
ttttacggtg gtatgaatgt aaatgtcagt 
tatgaaaact acctcaatca atttaatgct 
ggtagcaatc tagcagatgc tagttctcag 

30 atcggtattt tcctatccat tatctttatg 
caaatttctg aaggctacga agaccgtgaa 
gaccaaaagc aaatcaagca aaccatccac 
ttgctctttg ccttcataca tctcgccttt 
gtgattggtg tactggatac gactatgatg 

35 ttcctcatcg cctatgtgct gattttcatg 
caaatgtaa 

<210> 79 
<211> 891 
40 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 79 

atgaaacaag atcaactaaa ggcttggcaa 
45 ttagaacaag accagctcaa tcacgcctat 
atggcgcaat ttttagctaa gagcctcttt 
gagaaatgcc gaagttgcaa gctgattgaa 
aagccagtca atcaggtcat caagacagaa 
caagcaggga ttgaaagcca gcaacaggtc 
50 cccaacgcag ccaattctct gctcaaggtc 
ttcttcttga ctagcgatga ggaaaagatg 
ttccacttta aaaagcaaga agaaaaactt 
aagaaaaaag cgactctttt agctaagttt 
gctaatcagg caagtttttg gaccttggtc 
55 gtagctaaga aaaaagaaag ttatctacag 
aaggaaaaac aggatcaggt tttacggatt 
caggtaagag taagagtgat tctacaagat 



tcgaacttga ttaaaaaccg caaactctac 60 
gtcactctca cctatctctt ttactctcta 120 
ggaggaacaa ccattcaggc tacacttgga 180 
gccattatcg ttctctatgc caatagtttt 240 
atttatggca tgttgggctt ggagaagcgt 300 
gtggtatttg ggattctaac tgttggagcg 360 
ttaattttcg ctttcctgct caaactaatg 4'20 
cagacgaaag ttgtcattac agtgcttgtt 480 
ttcctgaatg cccttcgaat cgcccgtatg 54 0 
agtggagaga aaaaaggtcg cttccttcct 600 
ggaattggct attatcttgc ccttacggta 660 
ttcatagctg ttttactggt tatctttggg 720 
gttttcctcc aaatcttaaa gaaaaataag 780 
tctgtttcta acttgatttt ccgtatgaag 840 
attttgtcaa caatggtttt ggtaaccatg 900 
gaatccttta aaaaagttct aaatcctcat 960 
aaagaagatt tggacaaact cttgagccag 1020 
gaaaaagaag tgtttcgtta cacttacttt 1080 
accttttttg aaaaaggaca aaatcgtgtc 1140 
caaaaagatt atgaaaatat gactggtcaa 1200 
ctctttgcca aaaatgacgg actgaaagga 1260 
ttttctgtaa aagaagaatt taataaagat 1320 
aatattttga ctgctgatta caattacctt 1380 
caattcccag attcggatat ctataatcag 14 40 
gaagaagaac aactcaaggt cgctgaggag 1500 
caattagaca cagaaggtag ctatgtttat 1560 
atgagtgccc tctttggtgg tgtcttcttt 1620 
gtcggaactg ttctggtcat ctactacaaa 1680 
cgctttatta tcttgcagaa agtcggtttg 1740 
aaacaggttt taactgtttt cttccttcct 1800 
gcctaccata tgcttagcct gattttaaaa 18 60 
ttgattgtga ccttgtctat ctgcgctatc 1920 
attacttcaa gaagttatcg caagattgtg 1980 

1989 



ccagctcagt ttgaccgttt tgtccgtatc 60 
ctcttttcag gtttctttgg aagcttggaa 120 
tgtacggata aagttggcgt cttaccatgt 180 
caggaagagt ttccagatgt caccttgatt 240 
cgcattcggg aattggtggg acagttttct 300 
tttattatcg agcaagcgga taaaatgcat 360 
atcgaagaac cccagagtga agtttatatt 420 
ttaccgacaa tccgaagtcg gactcagatc 480 
atcttactct tagaacaaat gggacttgtt 540 
agtcaatcgc gagctgaagc agaaaagttg 600 
gatgaaagtg aacgcctgct gacttggtta 660 
gttgccaaat tagccaactt ggcagatgat 720 
cttgaagttc tctgtgggca ggacctcttg 780 
ttactagaag ctagaaaaat gtggcaagct 840 
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aatgtcagct ttcaaaatgc catggaatat ctggtcttga aagaaatata a 891 

<210> 80 
<211> 615 
5 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 80 

atgaattcat ttaaaaattt cttaaaagag tggggactgt tcctcctaat tctgtcatta 60 

10 ctagctttaa gtcgtatctt tttttggagc aatgttcgcg tagaaggaca ttccatggat 120 

ccgaccctag cggatggcga aattctcttc gttgtaaaac accttcctat tgaccgtttt 180 

gatatcgtgg tggcccatga ggaagatggc aataaggaca tcgtcaagcg cgtcjattgga 240 

atgcctggcg acaccattcg ttacgaaaat gataaactct acatcaatga caaagaaacg 300 

gacgagcctt atctagcaga ctatatcaaa cgcttcaagg atgacaaact ccaaagcact 360 

15 tactcaggca agggctttga aggaaataaa ggaactttct ttagaagtat cgctcaaaaa 420 

gcccaagcct tcacagttga tgtcaactac aacaccaact ttagctttac tgttccagaa 480 

ggagaatacc ttctcctcgg agatgaccgc ttggtttcga gcgacagccg ccacgtaggt 540 

accttcaaag caaaagatat cacaggggaa gctaaattcc gcttctggcc aatcacccgt 600 
atcggaacat tttaa * 615 



20. 



25 



<210> 81 
<211> 987 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 81 

atggtagtat ttacaggttc aactgttgaa gaagcaatcc agaaaggatt gaaagaatta 60 
gatattccaa gaatgaaggc tcatatcaaa gtcatttcta gggagaaaaa aggctttctt 120 
ggtctatttg gtaaaaaacc agcccaagtg gatattgaag cgattagtga aacgactgtt 180 

30 gtcaaagcaa atcaacaggt agtaaaaggc gttccgaaaa aaatcaatga tttgaacgag 240 
cctgtgaaga cggttagtga agaaaccgtt gaccttggtc atgtggttaa tgctattaaa 300 
aaaatagagg aagaaggtca aggtatttct gatgaagtca aggctgaaat cttaaaacat 360 
gaaagacatg ccagcactat cttagaagaa actggtcaca ttgagatttt aaatgaactt 420 
caaatcgagg aagcgatgag ggaagaagca ggcgctgatg accttgaaac tgagcaagat 480 

35 caaactgaaa atcaagactt gaaagagatg ggcttgaagg tcgagcaaag ttatgatatt 54 0 
gcccaggtgg ctacggatgt gactgcctat gttcaagcga ttgtggatga catggatgtt 600 
gaagctacac tttcaaatga ttataaccgt cgtagcatca atctacaaat tgacaccaac 660 
gaaccaggtc gtattatcgg ctaccatggt aaagtcttga aggccttgca actgttggct 720 
caaaattatc tttacaaccg ctattccaaa accttctacg ttacaatcaa tgtcaatgat 7'80 

40 tatgtcgaac accgtgcaga agtcttgcag acctatgcgc aaaaattggc gaatcgtgtt 84 0 
ttggaagaag gtcgcagtca taaaacagat ccaatgtcaa atagcgaacg caagattatc 900 
catcgtatta tttcacgtat ggatggcgtg actagttact ctgaaggtga tgagccaaat 960 
cgctatgttg ttgtagatac agaataa 987 

45 <210> 82 

<211> 1383 
<212> DNA 

<213> Streptococcus pneumoniae 
50 <400> 82 

atgtcaaatt ttgccattat tttagcagcg ggtaaaggga ctcgcatgaa atctgatttg 60 
ccaaaagttt tgcacaaggt tgcgggtatt tctatgttgg aacatgtttt ccgtagtgtg 120 
ggagctatcc aacctgaaaa gacagtaaca gttgtaggac acaaggcaga attggttgag 180 
gaggtcttgg ctggacagac a.gaatttgtg actcaatctg aacagttggg aactggtcat 240 
55 gcagttatga tgacagaacc tatcttagaa ggtgtgtcag gacacacctt ggtcattgca' 300 
ggagatactc ctttaatcac tggtgaaagc ttgaaaaact tgattgattt ccatatcaat 360 
cataaaaatg tggccactat cttgactgct gaaacggata atccttttgg ctatggacga 420 
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attgttcgta atgacaatgc tgaggttctt cggtcattgt tgagcagaag gatgctacag 480 
attttgaaaa gcaaatcaag gaaatcaaca ctggtaacat acgtctttga caacgagcgt 540 
ttgtttgagg ctttgaaaaa tatcaatacc aataacgctc aaggcgaata ctatattaca 600 
gacgtcattg gtattttccg tgaaactggt gaaaaagttg gcgcttatac tttgaaagat 660 
5 tttgatgaaa gtcttggggt aaatgaccgt gtggcgcttg cgacagctga gtcagttatg 720 
cgtcgtcgca tcaatcataa acacatggtc aacggtgtta gctttgtcaa tccagaagca 780 
acttatatcg atattgatgt tgagattgct ccggaagttc aaatcgaagc caatgttatc 840 
ttgaaagggc aaacgaaaat tggtgctgag actgttttga caaacggtac ttatgta.gtg 900 
gacagcacta tcggagcagg agcggtcatt accaattcta tgattgagga aagtagtgtt 960 

10 gcagacggtg tgacagtcgg tccttatgct cacattcgtc caaattcaag tctgggtgcc 1020 
caagttcata ttggtaactt tgttgaggtg aaaggatctt caatcggtga gaataccaag 1080 
gctggtcatt tgacttatat cggaaactgt gaagtgggaa gcaacgttaa tttcggtgct 1140 
ggaactatta cagtcaacta tgacggcaaa aacaaataca agacagtcat tggagtcaat 1200 
gtctttgttg gttcaaattc aaccattatt gcaccagtag aacttggtga caattccctc 1260 

15 gttggtgctg gttcaactat tactaaagac gtgccagcag atgctattgc tattggtcgc 1320 
ggtcgtcaga tcaataaaga cgaatatgca acacgtcttc ctcatcatcc taagaaccag 1380 
tag 1383 

<210> 83 
20 <211> 936 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 83 

25 atgtccaaga ttctagtatt tggtcaccaa aatccagact cagatgccat cggatcatct 60 

gtagcttttg cctaccttgc aaaagaagct tacggtttgg atacggaagc tgttgccctt 120 

ggaactccaa atgaagaaac agcctttgtc ttgaactatt ttggtgtgga agcaccaqgt 180 

gttatcactt ctgccaaagc agagggggca gagcaagtta tcttgactga ccacaatgaa 240 

ttccaacaat ctgtatcaga tatcgctgaa gtagaagttt acggtgttgt agaccaccac 300 

30 cgtgtggcta actttgaaac tgcaagccca ctttacatgc gtttggagcc agttggatca 360 

gcgtcttcaa tcgtttaccg tatgttcaaa gaacatggtg tagctgttcc taaagagatt 420 

gcaggtttga tgctttcagg tttgatttca gatacccttc ttttgaaatc accaacaaca 480 

cacccaacag ataaaatcat tgctcctgaa ttggctgaat tggctggtgt aaacttggaa 540 

gaatatggtt tggcaatgtt gaaagctggt accaacttgg ctagcaaatc tgctgaagaa 600 

35 ttgattgaca tcgatgctaa gacttttgaa ctcaacggaa ataatgtccg tgttgcccaa 660 

gtgaacacag ttgacatcgc tgaagttttg gaacgccaag cagaaattga agctgcaatg 720 

caagctgcca acgaatcaaa cggctactct gactttgtct tgatgattac agatatcgtc 780- 

aactcaaact cagaaatatt ggctcttggt gccaatatgg acaaggtcga agcggctttc 840 

aatttcaaac ttgaaaacaa tcatgccttc cttgctggtg ccgtttcacg taagaaacaa 900 

40 • gtggtacctc aattaactga aagctttaat acgtaa 936 

<210> 84 
<211> 678 
<212> DNA 
45 <213> Streptococcus pneumoniae 



<400> 84 

atgatttcaa agagattaga attggtagct 
gatgtgggaa gtgaccatgc ttatctgcct 

50 agcgctattg caggtgaggt ggtggaaggt 
gctcacggcc taaaggagaa aatccaagtc 
gagactgacc aagtgtctgt cattaccatt 
attttagaag aaggtttggg gaagttagct 
aatcgtgaag acgacttgcg tatctggcta 

55 agcatcttag aagaagctgg aaagttttat 
aagctatcag ccagtgatgt tcgctttggt 
tttgtccaaa aatggcaaaa agaagctgag 



tcctttgtgt cacagggggc tattttacta 60 
atcgagttgg ttgagagagg ccaaatcaaa 120 
ccctatcagt ctgcggttaa aaatgttgag 180 
cgtttagcca atggcttggc agcttttgaa 240 
gctggcatgg gtggtcgttt gattgctagg 300 
aatgtagagc gtttgatcct ccagcccaat 360 
caggatcatg gattccagat tgtagcagaa 420 
gagattttgg tggtggaagc aggacaaatg 480 
cccttcttgt ccaaagaagt cagtccagta 540 
aagctagagt tcgccctcgg acaaatccca 600 
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gaaaaaaatc tggaagaacg tcaagttcta gtagataaga ttcaagctat caaggaggtg 660 
ctccatgtta gcaagtga 678 

<210> 85 
5 <211> 486 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 85 

10 atgaatttaa acgatattaa agacttgatg actcaatttg accagtcaag tttgagagaa 60 
ttttcttata aaaatgggac ggatgagttg cagtttagca agaatgaagc gagacctgtg 120 
cctgaagttg caactcaagt cgctccagca cccgttctag caacaccgag tccagtagct 180 
cctacatctg ctccagcaga gactgtagca gaagaagttc cagctccagc tgaagcaagt 240 
gtggctagtg agggaaatct tgtagagagt ccacttgttg gagtggttta cttggctgct 300 

15 ggaccagata aacctgcctt cgttacagtt ggtgatagtg tcaaaaaagg tcaaacattg 360 
gtaattatcg aagccatgaa agtcatgaat gaaatcccag ctcctaagga tggtgtggta 4 20 
acggaaattc tcgtctctaa cgaagaaatg gttgagtttg gtaaaggatt ggtacgtatc 4 80 
aaatga 486 

20 <210> 86* 
<211> 1236 
<212> DNA 

<213> Streptococcus pneumoniae 
25 <400> 86 

atgaaactaa atcgagtagt ggtaacaggt tatggagtaa catctccaat cggaaataca 60 
ccagaagaat tttggaatag tttagcaact gggaaaatcg gcattggtgg cattacaaaa 120 
tttgatcata gtgactttga tgtgcataat gcggcagaaa tccaagattt tccgttcgat 180 
aaatactttg taaaaaaaga taccaaccgt tttgataact attctttata tgccttgtat 240 

30 gcagcccaag aggctgtaaa tcatgccaat cttgatgtag aggctcttaa tagggatcgt 300 
tttggtgtta tcgttgcatc tggtattggt ggaatcaagg aaattgaaga tcaggtactt 360 
cgccttcatg aaaaaggacc caaacgtgtc aaaccaatga ctcttccaaa agctttacca 420 
aatatggctt ctgggaatgt agccatgcgt tttggtgcaa acggtgtttg taaatctatc 480 
aatactgcct gctcttcatc aaatgatgcg attggggatg ccttccgctc* cattaagttt 540 

35 ggtttccaag atgtgatgtt ggtgggagga acagaagctt ctatcacacc ttttgccatc 600 
gctggtttcc aagccttaac agctctctct actacagagg atccaactcg tgcttcgatc 660 
ccatttgata- aggatcgcaa tgggtttgtt atgggtgaag gttcagggat gttggttcta 720 
gaaagtcttg aacacgctga aaaacgtgga gctactatcc tggctgaagt ggttggttac 780 
ggaaatactt gtgatgccta ccacatgact tctccacatc cagaaggtca gggagctatc 840 

40 -aaggccatca aactagcctt ggaagaagct gagatttctc cagagcaagt agcctatgtc 900 
aatgctcacg gaacgtcaac tcctgccaat gaaaaaggag aaagtggtgc tatcgtagct 960 
gttcttggta aggaagtacc tgtatcatca accaagtctt ttacaggaca tttgctgggg 1020 
gctgcgggtg cagtagaagc tatcgtcacc atcgaagcta tgcgtcataa ctttgtacca 1080 
atgacagctg ggacaagtga agtatcagat tatatcgaag ctaatgtcgt ttatggacaa 1140 

45 ggcttggaga aagaaattcc atacgctatt tcaaatactt ttggttttgg aggccacaat 1200 
gcagttcttg ctttcaaacg ttgggagaat agataa 1236 

<210> 87 
<211> 1080 
50 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 87 

atgaacatct atgatcaact acaagttgta 
55 ctgagtgacc ctgatgtcgt ttcagacacc 
gcttccaatc gtgacaccgt aatagcctac 
gtcgatgccg aagagatgat taaggaatca 



gaagaccgtt atgaagaatt aggagaattg 60 

aagcgtttta tggagctttc aaaagaagaa 120 

cgtgagtata aacaagtcct tcaaaatatc .180 

ggcggagatg cggacttgga agaattggcc 240 
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aagcaagaac tcaaagatgc caaggctgaa 
ttgctccttc caaaggatcc aaacgatgac 
gctggtggag acgaagcggc acttttcgct 
gcggaagccc aaggttggcg ctttgaagtc 
5 tttaaagaag tggttgctat ggtttcaggt 
tcaggtgccc accgtgtgca acgtgttcct 
tcgacagcga cagttcttgt tatgccagaa 
aaagaccttc gtgtcgacat ctatcacgcc 
gttgcgactg ccgttcgtat cgttcacttg 
10 gaacgtaccc agcagaagaa ccgcgagaag 
gaccactttg ctcagattgc tcaggatgaa 
actggtgacc gttcagaacg gatccgaact 
caccgtatcg gcttgaccct ccaaaaacta 
gttgtggatg ccttggtgct ttatgaccaa 

15 

<210> 88 
<211> 1680 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 88 

atggcctaca ctcttaaacc tgaagaagtt 
atcgggaaaa acacttacgg aattgaatac 
attaaattcc cagaagatga cttgcttggt 

25 atcgtagaca atatcgaccg cgtcaaggct 
attggtggga ttccgttcct actcaagcaa 
gccttggctt tgatccgtgg gaaactcgaa 
tacgaaatca accacaacac cgagttgacc 
acgactcact ctattccaga gcctttgggg 

30 gtctgtacgg gtgactttaa gttcgacttt 
cgtatggctg cgcttggtga agaaggcgtg 
gaagtaccaa cctttaccaa ctctgaaaaa 
caaggtattg aaggacgtat catctttgca 
caggcaacag aagctgctgt taagactgga 

35 gaaaaggcca ttgtcaacgg aatcgatctt 
atcgagccaa atgaaatcaa agattatcct 
agtcagggtg agcctatggc agccctctct 
caattacaac caggtgatac cgttatcttc 
agcgtcaaca agctgattaa catcatttct 

40 gtgaacaata t-ccatacatc tggacacggt 
ttgattaagc caaaatactt catgcctgtc 
gctggactag cagtggatac tggtgttgag 
gatgtgcttg cccttactgc tgactcagct 
atctatgtcg atggaaatcg tatcggtgaa 

45 gatctatctg aagacggtgt cgttctggca 
attctgtctg gcccagatat cctcagccga 
ttgattcgcc aaagccagcg tatcctcttc 
gatgctagcg tgcaatctgt caatggtgcc 
gaaaataccg aacgtgaacc gatcatcatc 

50 

<210> 89 
<211> 1362 
<212>* DNA 

<213> Streptococcus pneumoniae 

55 

<400> 89 

atggcagaag tagaagagtt acgagtacaa 



aaagaagaat atgaagaaaa actgaaaatt 300 

aagaatatca tccttgaaat ccgtggagca 360 

ggagatttgc taactatgta ccaaaagtat 420 

atggaagcct ctatgaatgg tgtcggtggt 480 

cagtctgtat actctaagct taagtatgaa 540 

gtgacagaaa gccaaggccg tgttcatact 600 

gttgaagagg ttgaatacga cattgatcca 660 

tctggtgctg gtggacagaa cgtcaataag 720 

ccaaccaata tcaaggttga gatgcaggaa 780 

gctatgaaga ttatccgtgc acgcgtcgct 840 

caagacgctg agcgtaagtc gacaatcggt 900 

tataacttcc cacaaaaccg tgtcacagac 960 

gatacgattt tgtctggtaa attggacgaa 1020 

acacaaaaac tagaagaatt aaacaaataa 1080 



ggtgtttttg ccatcggtgg tctaggagaa. 60 

caagacgaga ttatcatcgt cgatgctggg 120 

atcgactatg tcattcctga ctactcttac 180 

gttttaatca cacacggaca cgaggaccac 240 

gcaaatgtcc ctatttatgc tggaccgctt 300 

gaacacggcc tcttgcgcaa cgccaaactt 360 

tttaaaaatc tcaaggcaac tttctttaga 420 

attgtcattc atactcctca agggaaaatc 480 

actccagttg gagaacctgc ggacttgcat 540 

ctctgtctcc tgtctgactc gacaaatgcg 600 

gtcgttggtc agtccattat gaagattatc 660 

tcctttgcct caaatatctt ccgtctccag 720 

cgcaagattg cggtctttgg tcgttctatg 780 

ggctacatca aagctcctaa gggaaccttt 840 

gcaggagaag ttcttatcct ctgtacaggt 900 

cgtatcgcca acggaaccca ccgtcaagta 960 

tcttctagtc ccatccctgg aaacactact 1020 

gaagctggtg tcgaagttat ccacggtaaa 1080 

ggtcagcaag agcaaaaact catgctctgc 1140 

cacggtgaat accgcatgca aaaagtccac 1200 

aaggacaata tctttatcat gagcaatggc 1260 

cgtatcgcag gtcatttcaa cgcccaagat 1320 

attggcgcag ctgtcctcaa agatcgtcgc 1380 

gtcgcaactg ttgacttcaa atcgcagatg 1440 

ggctttgtct acatgagaga gtctggagac 1500 

aatgccattc gtatcgcact gaaaaataag 1560 

attgtcaacg ctattcgccc cttcctctat 1.620 

ccgatgatcc tcacaccaga tgaagaataa 1680 



cctcaagata tcttagctga gcaatccgtt 60 
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ttaggggcta tctttattga tgagagtaaa cttgtttttg tgcgagaata cattgagtct 120 

cgggactttt ttaagtatgc ccatcgtttg attttccaag ccatggtcga tttatccgat 180 

cgtggtgatg ccatagatgc aacaacggtt cgtactatcc ttgataatca aggtgattta 240 

cagaatattg gtggcttgtc ttacttggtt gagattgtta attctgtgcc aacttctgct 300 

5 aatgcggagt attatgctaa gattgttgca gaaaaagcaa tgctacgtcg tttaattgcc 360 

aagttgacag agtctgtcaa ccaagcttac gaagcgtcac aaccagctga tgaaattatt 420 

gctcaggcag aaaaagggtt gattgatgtc agtgaaaatg caaatcgaag cgggtttaag 4 80 

aacattcgag atgtgttgaa tctcaacttt ggaaatctgg aagctcgctc gcaacaaacg 540 

accgatatta caggtattgc gacaggttat cgtgatttgg atcatatgac aacaggactt 600 

10 catgaggagg agttgattat cttagcagct cgtccagcag ttggtaagac agcatttgcc 660 

ttgaatatcg ctcagaacat tgggactaag ttggacaaaa cggttgctat tttttcactc 720 

gaaatgggtg cggaaagctt ggtagatcgt atgttagctg cagaaggctt ggtggagtca 780 

cattctatcc gtacagggca attgacagat gaggagtggc aaaaatatac tattgctcag 840 

ggtaatctag ctaacgccag tatctatatc gatgatacgc caggtattcg gattacagag .900 

15 attcgttctc gttctcgtaa attggctcaa gaaactggaa atcttggttt gattgtgata 960 

gactatttgc aacttatcac gggaactggt cgagaaaatc gtcaacaaga agtttctgaa 1020 

atttctcgtc agttgaaaat actagccaag gaattgaagg ttccagtaat cgctctgagt 1080 

cagctttctc gtggtgtaga acaacgtcag gacaagagac cggtcttgtc tgatattcgt 1140 

gaatctgggt ctattgagca ggacgctgat atcgtagctt ttctctatcg cgatgactac 1200 

20 tatgaacgtg gtggtgaaga agaggagggt atcccaaata ataaggtgga agttattatc 1260 

gagaaaaacc gtagtggagc tcgtggaaca gtggaattga ttgtccaaaa agaatacaat 1320 

aaattcttca agtattcaag tatctcaaag agggaggcat aa 1362 



<210> 90 
25 <211> 693 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 90 
30 atggcgtata 
tatcttggac 
agtaaaatgg 
aaaggcgatg 
gcaagtgacc 
35 ttggatctga 
gcagctttta 
cgtaccttag 
aaagggttaa 
aaagagatcc 
40 gctcgcaaga 
gatgcacaaa 



aatatttagt 
gaaactataa 
gtatcgattt 
tcatcaaaag 
cggatagaga 
aagaaaaaaa 
aggaaccaag 
acagaatcgt 
gtgcaggacg 
gagaatttgt 
aattcaaagc 
gtgtaaaaag 



gattgtagaa 
agtaatggcc 
tgaaaacaat 
cctgaaagcc 
aggagaagcg 
tcgggtggtc 
aacgatcgat 
tggttattcg 
tgtccaatct 
tccagaagaa 
aaatttctgg 
aagtcactgc 



tcacctgcaa 
agtgttggcc 
tatgaacccc 
gcagcaaaaa 
attgcttggc 
ttcaatgaaa 
gtagatttag 
atcagtccta 
gtcgctttaa 
tattggagca 
ggaatcgacg 
tag 



aagccaagac 
atataagaga 
attatatttc 
aagctcaaaa 
atttagcgta 
tcacaaaaga 
tagatgcaca 
ttctctggcg 
aaattattat 
tcgacggtaa 
gtaagaaaaa 



aattgagaaa 60 
tttaccgaaa 120 
tatacgcgga 180 
agtttacttg 24t) 
cctacttggg 300 
cgcagtcaaa • 360 
gcaagctcgt 4 20 
taaggtcaag 480 
tgaccgtgaa 540 
ttttaaaaaa 600 
gaaattacca 660 
693 



<210> 91 
<211> 981 
45 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 91 
atgtttattt 

50 gcctttatcc 
aaacagcatc 
tctgttttgg 
atgattttgt 
gtctttcgta 

55 ggtggagtta 
tatcaagtgc 
tcaaacgcag 



ccatcagtgc 
aattttatag 
aggcaaaagc 
ttgctttctt 
tcatcttggt 
aaatcaatga 
tcttctatct 
atctagggat 
taaacttgac 



tggaattgtg 
aaaggcgcaa 
tgggactcct 
tttcgcccta 
cttgtatggc 
ggggcttaat 
tttctatgag 
tttctatatt 
agacggtgtt 



acatttttac 
attacaggcc 
acaatgggag 
tttagtagcc 
ttggtcggat 
cctaagcaaa 
cgcggtggcg 
gttttcgctc 
gacggtttag 



taactttagt 
agcagatgca 
gtttggtt-tt 
aattcagtaa 
ttttagatga 
aattagctct 
atatgctttc 
ttttctggct 
ctagtatttc 



aggaattccg 60 
tgaggatgtc 120 
cttgattact 180 
taatgtggga 240 



ctttctcaag 
tcagcttcta 
tgtctttggt 
agtcggtttt 
cgttgtgatt 



300 
360 
420 
480 
540 
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agtttgtctg cctatggagt tattgcctat 
attctggcca tgattggtgg tttgctcagt 
atctttatgg gtgatgtggg aagtttggct 
gctctccacc aagaatggac tctcttgatt 
5 tctgttatga tgcaagtcag ttatttcaaa 
acgcctgtac atcaccattt tgagcttggg 
gagtggaagg ttgacttctt cttttgggga 
gcaattttat atttgatgta a 

10 <210> 92 

<211> 2082 
<212> DNA 

<213> Streptococcus pneumoniae 

15 <400> 92 

atggcacgcg aattttcact tgaaaaaact 
gccggtaaaa caacaactac tgagcgtatt 
ggtgaaactc acgaaggtgc gtcacaaatg 
atcacgatca catctgctgc gacgacagct 

20 gacacaccag gacacgtgga cttcacaatc 
ggtgcggtta ccgttcttga ctcacaatca 
cgtcaagcaa ctgagtacgg agttccacgt 
ggtgctgact tcctttactc tgtaagcaca 
ccaatccaat tgccaatcgg ttctgaagat 

25 atgaaagctg aaatctatac taacgacctt 
gctgaatacc ttgaccaagc tcaagaatac 
actgacgaag aattgatgat gaaatacctc 
aaagctggta tccgtaaagc gactatcaac 
gccttcaaaa acaaaggtgt tcaattgatg 

30 ccacttgaca tcccagcaat caaaggtatt 
ccagcatctg acgaagagcc atttgcagct 
gtaggtcgtt tgacattctt ccgtgtttac 
ttgaatactt ctaaaggtaa acgtgaacgt 
agccgtcaag aaatcgacac tgtttactca 

35 gatactacaa ctggtgactc attgacagat 
aacgttccag aaccagttat ccaattgatg 
aagatgggta tcgcccttca aaaattggct 
aacgttgaaa ctggtgaaac agttatctca 
gttgatcgta tgcgtcgtga gttcaaagtt 

40 taccgtgaaa cattccgcgc ttctactcaa 
ggtaaaggtc aattcggtga tgtatggatt 
ttcgaattcg aaaacgcaat cgtcggt'ggt 
gaaaaaggtt tggtagaatc t'atggctaac 
gttaaagcta agctttatga tggttcatat 

45 aagattgcgg cttcactttc ccttaaagaa 
gaaccaatga tgcttgtaac aatcactgtt 
cacgtaactg ctcgtcgtgg acgtgtagat 
gttcgtgctt acgttccact tgctgaaatg 
tctcaaggac gtggtacatt catgatggta 

50 gtacaagaag aaattattaa gaaaaataaa 

<210> 93 
<211> 1227 
<212> DNA 
55 <213> Streptococcus pneumoniae 

<400> 93 



gtgcaaggtc agatggatat tcttctagtg 600 

ttcttcatct ttaaccataa gcctgctaag 660 

ttaggtggaa tgctggcagc tatctctatg 720 

atcggaattg tgtatgtttt tgaaacaact 780 

ctgacaggtg gtaaacgtat tttccgtatg 840 

ggattgtctg gtaaaggaaa tccttggagc 900 

gttgggcttc tagcaagtct cctgacccta 960 

981 



cgtaatatcg gtatcatggc tcacgtcgat 60 
ctttactaca ctggtaaaat ccacaaaatc 120 
gactggatgg agcaagagca agaacgtggt 180 
caatggaaca accaccgcgt aaacatcatc 240 
gaagtacaac gttctcttcg tgtattggat 300 
ggtgttgagc ctcaaactga aacagtttgg 360 
atcgtatttg ccaacaaaat ggacaaaatc 420 
cttcacgatc gtcttcaagc aaatgcacac 4 80 
gacttccgtg gtatcattga cttgatcaag 540 
ggtacggata tccttgaaga agacatccca 600 
cgtgaaaaat tgattgaagc agttgctgaa 660 
gaaggtgaag aaatcactaa cgaagaattg 720 
gttgaattct tcccagtatt gtgtggttca 780 
cttgatgcgg ttatcgacta ccttccaagc 840 
aacccagata cagacgctga agaaattcgt 900 
cttgccttca agatcatgac. tgacccattc 960 
tcaggtgttc ttcaatcagg ttcatacgta 1020 
atcggacgta tccttcaaat gcacgctaac 1080 
ggtgatatcg ctgctgccgt tggtttgaaa 1140 
gaaaaagcta aaatcatcct tgagtcaatc 1200 
gttgagccaa aatctaaagc tgaccaagac 1260 
gaagaagatc caacattccg cgttgaaaca 1320 
ggtatgggtg aacttcacct tgacgtcctt 1380 
gaagcgaacg taggtgctcc tcaagtatct 1440 
gcacgtggat tcttcaaacg tcagtctggt 1500 
gaatttactc caaacgaaga aggtaaagga 1560 
gtggttcctc gtgaatttat cccagcggtt 1620 
ggtgttcttg caggttaccc aatggttgac 1680 
cacgatgtcg actcatctga aactgccttc 1740 
gctgctaaat cagcacaacc agctatcctt 1800 
ccagaagaaa accttggtga tgttatgggt 1860 
ggtatggaag cacacggtaa cagccaaatc 1920 
ttcggttacg caacagttct tcgttctgca 1980 
tttgaccact acgaagatgt acctaagtca 2040 
ggtgaagact aa 2082 
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atgccaaatt acaWtattcc attttcaccg 
gtagcggata ccctgcgttc tggttggatc 
cgccgcttgt ctctttacac acagacacct 
gctctggagt tgattttacg cgttttggaa 
5 gccatgacct atacggcttc atgtagtgtc 
gtggatatcc aagcagatac gtttgagatg 
gagaaaacta aggtgattat cccagtagag 
ttgttccaag tcgtggagaa aaaacgtgac 
gcctttaacc gtattgtcat tgtctctgat 

10 ggacaacctt ctggttctat cgctgatttt 
tttacaacgg cagaaggtgg aagtgcgact 
gagatgtaca aggaattcca aatcctttcc 
aagatgcaac tggggtcatg ggaatacgat 
accgatatca tggcttcact tggtttggta 

15 cgccgtaagg acattgtgga ccgctatgat 
ttggcacaca agactgaaac tgtcgaatct 
ggagcaagcc tagaagaacg cagcctcatc 
agtaatgttc actacaaacc gcttcctctc 
atgacgaact atcctaaggc ctatgccttc 

20 actaaattaa gcgatgaaga agtagactat 
aaagtgctaa ctttatcaaa aaaatga 

<210> 94 
<211> 978 
25 <212> DNA 

<213> Streptococcus pneumoniae 



cctgatatca cagaagcaga aattgctgaa 60 
acaacaggtc ctaaaacaaa agaactggag 120 
aagactgttt gtctcaactc tgcgacagcc 180 
gtgggacctg gtgatgaagt catcgttcca 240 
attacgcacg tgggagcaac ccctgtcatg 300 
gactatgacc tgcttgagca agctatcact 360 
ctcgcaggga ttgtttgcga ttatgaccgt 420 
ttctttaccg cttcaagcaa gtggcaaaag 480 
agtgcccacg ctttgggatc tacttataaa 540 
acttccttct cattccatgc cgttaagaac 600 
tggaaagcca atccagtgat tgatgacgaa 660 
cttcacgggc aaactaagga tgctcttgcc 720 
atcgttacac cagcctataa gtgcaacatg 780 
caattggacc gctatccaag tttgttgcaa 840 
agtggttttg caggttctcg catccatcct 900 
tcacgccacc tctacatcac ccgtgtagaa 960 
atccaagaat tggctaaagc aggaattgca 1020 
ttgacagcct ataagaatct tggatttgat 1080 
tttgagaatg aaattaccct ccctcttcat 1140 
atcattgaga ctttcaaaac agtttctgaa 1200 

1227 



<400> 94 * . 

atgacagaac ctgatttttg gaacgataat attgcggccc aaaaaacgtc gcaagaattg 60 

30 ' aatgtattta aaaacactta caataccttc cataagatgg aagagttgca ggatgaagtc 120 
gaaattttat tggatttttt ggctgaagac gagtcagtgc atgatgaact ggtagcgcag 180 
ttagccgaac ttgataagat- aatgaccagc tacgagatga ctttactctt gtcagaacct 240 
tatgaccaca acaatgccat cttggaaatc catccaggtt ctggtggtac tgaggcgcag 300 
gactggggtg atatgttgct tcgtatgtat actcgttatg gtaatgctaa aggctttaaa 360 

35 gtggaagtgt tggattacca agcaggtgat gaggctggta ttaagtcggt aactttatca 420 
tttgaagggc ctaatgccta tggtctcctc aagtcagaaa tgggtgttca ccgcttagtg 480 
cgaatctcac catttgactc tgccaaacgt cgccatacct ctttcacatc tgtagaagtg 540 
atgccagaat tggatgatac tattgaagtg gaaatccgtg aagatgatat caagatggat 600 
accttccgtt caggtggtgc cggtggacaa aacgtcaata aagtttcaac aggtgtacgt 660 

40 ttaacccaca ttccaactgg aattgttgtc caatcaacag tggatcgtac ccagtatgga 720 
aatagagatc gtgccatgaa gatgttgcag gctaagctct atcaaatgga gcaagagaag 780 
aaggctgcgg aggtagattc tctcaaaggt gagaaaaagg agattacttg gggaagccaa 840 
atccgttctt atgtcttcac gccttatact atggtaaaag atcaccgaac tagctttgag 900 
gttgctcagg tagataaggt tatggatggg gacctagatg gttttatcga tgcttatctc 960 

45 aagtggcgaa ttagctaa 978 

<210> 95 
<211> 750 
<212> DNA 
50 <213> Streptococcus pneumoniae 

<400> 95 

atgttttata cttatttgcg tggattagtt gtattgctct tatggtccat caatggcaat 60 

gctcactatc ataatactga taaaattcct aatcaagatg aaaattatat tttagttgcg 120 

55 cctcaccgta cctggtggga tcctgtttat atggcctttg cgaccaagcc aaaacagttc 180 

atctttatgg caaaaaaaga actctttacc aaccgtatct ttggttggtg gattcgtatg 240 

tgtggcgcct ttcccatcga ccgtgaaaat cccagcgcct cagccatcaa atatcctatc 300 
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aacgttctca aaaaaagtga ccgctctctc atcatgtttc caagtggtag ccgccactca 360 

aacgatgtca aggggggcgc agcactgatt gccaaaatgg ccaaggtccg tatcatgccg 420 

gttacctaca ccggtcccat gactttgaag ggcttgatta gccgtgaacg tgtcgatatg 480 

aactttggaa atccaatcga tatctcagat atcaagaaaa tgaatgatga aggcattgaa 540 

acagtcgcca atcgtattca aacagaattc caacgtctgg acgaagaaac gaaacaatgg 600 

cacaatgata aaaaaccaaa tccactctgg tggtttatcc gcatccctgc cctcatcctt 660 

gctattatcc tcgctatcct aaccatcatc tttagcttta tcgcaagctt catctggaac 720 
ccagataaga aaagagaaga acttgcatag 750 

<210> 96 
<211> 3102 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 96 

ttgatcgcac aactagatac aaaaacagtc tatagtttta tggaaagcgt catttcgatc 60 
gaaaagtatg tgagagcagc taaagaatac ggctacactc atttggctat gatggatatt 120 
gacaatcttt atggcgcttt cgactttcta gagattacaa aaaaatacgg cattcatcct 180 
ttgctagggc ttgaaatgac agtgtttgta gatgatcagg gagtaaattt gcgcttttta 240 
gctctatcta gtgtgggcta tcagcagttg atgaagcttt cgacagccaa gatgcagggg 300 
gagaaaactt ggtcagtcct gtcccagtac ctggaggata tcgcggtcat tgtgccttat 360 
tttgatagag ttgagtcgtt agaactaggc tgtgattact atataggggt ttatccagaa 420 
acactagcaa gcgaatttca tcatcctatc ttacctcttt atcgggtcaa cgcttttgaa 480 
agcagggata gagaagttct tcaagtttta acagcgatta aagaaaatct accgctcaga 540 
gaagttccct tgcgttcgag acaagatgtc tttatatcag caagttcttt agagaaacta 600 
ttccaagagc gttttccgca agctttggac aatttagaaa agcttatttc aggcatttct 660 
tacgacttgg atactagtct gaaactgcct cgttttaatc cagctagacc agcagtagag 720 
gagttgagag agcgtgctga actggggctt gttcagaagg ggttgactag taaagaatat 780 
caagatagac tagaccaaga att'gtctgtt attcatgata tgggctttga tgattatttc 840 
ttggttgttt gggatttgtt gcgttttgga cgatcgaatg gctattatat gggaatggga 900 
aggggttctg cagtaggcag tttggtttct tatgccttag acatcacggg gattgaccca 960 
gtagagaaaa atctgatttt tgaacgcttt cttaatcgtg aacgctatac catgcctgat 1020 
attgatattg atatcccaga tatttatcgt ccagatttta tcagatatgt tggtaataaa 1080 
tatggtagta aacatgcggc acaaatcgtt actttttcaa cctttggagc caagcaagct 1140 
cttcgagatg tcttgaaacg ctttggtgtg ccagagtatg aattatctgc aattactaag 1200 
aaaatcagtt ttcgtgacaa tcttaagtcg gcctatgagg gcaatctcca gtttcgtcag 1260 
caaatcaata gtaagttaga ataccaaaaa gcttttgaga ttgcttgcaa gatagagggc 1320 
tatccaaggc aaacctctgt ccatgcggct ggtgttgtaa ttagtgacca agatttaacc 1380 
aactacattc ctctaaagta tggtgatgaa attccactga ctcagtatga tgctcatgga 1440 
gttgaggcta gcggactttt gaagatggac tttctgggac tacgaaattt gacctttgtc 1500 
cagaagatgc aagagttgct tgctgaaata gaaggtattc accttaaaat tgaagaaata 1560 
gatttggaag acaaagaaac gttagattta tttgcctctg gaaatacaaa aggtatcttt 1620 
caatttgagc aacctggtgc tattcgcttg ctcaaacgtg ttcaaccagt ctgttttgaa 1680 
gatgtcgtag caactacttc tctaaatcga ccgggtgcta gtgactatat caataatttt 1740 
gtggcaagaa agcatgggca ggaagaagtg actgttctgg atccagtact ggaggatatt 1800 
ttggctccaa cctacggcat aatgctctat caggagcagg ttatgcaggt tgcccagcga 1860 
tttgccggat ttagtcttgg gaaagccgat attttgcgtc gggctatggg gaaaaaggat 1920 
gcctctgcca tgcatgagat gagggcttcc tttattcaag gttcattaga agctggtcat 1980 
actgtggaaa aagcagagca ggtctttgat gttatggaga agtttgcagg ctatggtttt 2040 
aacaggtcac acgcctatgc ctactcagca ttggccttcc agttggctta ttttaaaaca 2100 
cattatccag ccatatttta tcagatcatg ttgaattctg ccaacagtga ttacttaata 2160 
gatgcacttg aagcaggttt tgaagtggcg cctctgtcca tcaacacgat tccctatcac 2220 
gataaaattg ccaacaaggc catctatcta ggtttgaaat ccattaaagg agtcagtaat 2280 
gatttagctc tctggattat tgaacataga ccttattcta atattgaaga ttttatagct 2340. 
aaattacctg agaattatct gaaacttcct ctgctagaac ctttggtaaa agttggtctt 2400 
ttcgattcat ttgaaaaaaa tcgtcaaaaa gtatttaata acttagctaa tctatttgaa 2460 
tttgtgaaag agttgggaag tttgtttgga gatgctattt atagttggca ggaatcggaa 2520 
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gattggacgg aacaagaaaa attttatatg 
aaacatccac tacaagctat tgcaagtaag 
ttgtcagaaa atagctatgc tattattttg 
accaaaaagg gtgaaaatat ggccttctta 
5 gtcactctct tttcagactt atatcgtcag 
tactatgtaa aaggaaaaat acaatcacgt 
ataagagaag cagttgctga acgcttttgg 
gaaatttcac gtattttaga acaatttaaa 
gaggaacaga aaaccatcgt ttctccccat 
10 gagaaattga atgaaatcgt tatgaaaacg 

<210> 97 

<211> 921 

<212> DNA 

15 <213> Streptococcus pneumoniae 



gaacaagagc ttttagggat aggtgtcagc 2580 
gctatttacc cgattacccc aatcggaaat 2640 
gttgaagttc agaaaataaa agtgattcgt 2700 
caggcagatg atagtaagaa aaaattggat 2760 
gttggacagg aaataaaaga gggagccttc 2820 
gatggccgtc tgcaaatgat tgcacaagaa 2880 
atacaggtga aaaatcatga atcggatcaa 2940 
ggcccaatcc cagtcatcat ■ ccggtatgaa 3000 
cattttgtag ctaaatccaa tgaattagag 3060 
atttatcgct aa 3102 



<400> 97 

atgaccaacg 

cgaaagacaa 

20 caaatcagtc 
tacaagcgaa 
aaatctcaac 
agtcgcctac 
acaactgatg 

25 aaaggatttc 
aatggacaag 
aagaaacaaa 
aatccacaaa 
gctggagtgg 

30 gcccaaaacg 
gagtcctttg 
gaaccaacca 
gaaatttact 



35 



40 



aatttttaca 
cacctccttt 
tccaagacgt 
ctaaggaaga 
cttttattat 
ttcaaatcct 
gttttctcta 
ctgaaagcta 
atgtagatat 
gtgtcaaagc 
acgatcgtct 
atgatattga 
accctgatag 
cccatcaggt 
gaaatcgtgc 
taaaaaagta 



ttttgaaaaa 
gacagaagaa 
tacagatatc 
tttagccttt 
tggggtttct 
actgtcccgt 
tcccaatcaa 
tgatatggaa 
tcctgtctat 
tgctgatttt 
ctatatcact 
aagttggtat 
ctactattat 
ctggaccagt 
agaagtgatt 



atcagccgcc 
gaattggaat 
tatctcccct 
tcaaaaggaa 
gggagtgttg 
acgtttacag 
accttgattg 
gctcttctca 
tctcatgaag 
gtaatcgttg 
gacttctttg 
ctggaccgtt 
cgttttactc 
atcaatctca 
cttcataaaa 



agacttggca 
ctatcaagag 
tggctcatct 
ttttcctcca 
ccgttggaaa 
atgctacggt 
agcagggaat 
acttcttgga 
tttacgacat 
agggaattaa 
acttttccat 
tcttgaaaat 
agatgccgat 
caaatctgca 
gcaagaacca 



atctttacat 60 
ttttaatgac 120 
gattcagatt 180 
acgtgaaagt 240 
atccacaacc 300 
tgagttggtt 360 
tttaaatcgt 420 
ccgcatcaaa 480 
cgtacccaaa 54 0 
tgtctttcaa 600 
ctatgtagat 660 
gctgagtcta 720 
tggggaagtg 780 
aaattatatt 840 
tgaaatcgat 900 
921 



<210> 98 
<211> 741 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 98 



atggaaattt 
aaccactatg 
catcgtgcag 
gatacccaga 

45 gaaaatcaaa 
ttggaagtcc 
atcggcttga 
gaattgctca 
attatcaccc 

50 cttgagtcag 
agtgagattc 
gttcgttttg 
aacgaggagg 



cattattaac 
tcaatagagc 
ggaatatcgc 
tcgatacagt 
agattcacca 
ttgctattat 
ttcgtggaga 
aggctggtca 
agtctattgg 
gtgactatct 
gtgatattgt 
ctaacaatgc 
atgaagaatg 



agatgttggt 
tggacgtacc 
tagt'gaaatg 
caatgaagtg 
gcttggtcag 
tgataatcag 
agaataccat 
attgacacca 
gcaaaaagat 
cttgctcaat 
aaccagtgat 
aggaggttta 



cagaaacgaa 
atgattattt 
gcggtcacag 
cgtgaatggt 
gatgaagctt 
gctatctatg 
cagttgacga 
gaagaggcag 
gaaattcagc 
agtgacggct 
attcctttag 
gacaacatta 



caaataacca 
tagctgatgg 
acctgggtgt 
tcgcccatta 
acagaggcat 
ctcatattgg 
gcgatcattc 
aagctcatcc 
ctgattttgg 
tgaccaacat 
cagataaaac 
cggttgccct 



agactatgtc 60 
gatgggaggt 120 
agcttgggtt 180 
cctagaaatt 240 
gggaactact 300 
tgattcgcgt 360 
cttggttaat -420 
gcaaaaaaat 480 
gacagttatc 540 
gatttcaggc 600 
ggagacactt 660 
tgtttctatg 720 
741 



55 



<210> 99 
<211> 831 
<212> DNA 
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<213> Streptococcus pneumoniae 



<400> 99 

gtgacgatac agatgaagaa tacaggtaaa 
5 cagagtcaaa gggtcttgta tgaattgcga 
aatgatacca atccggatat tgtcatttcc 
tttcataagt acgaaaatca gcttgacaag 
ttgggcttct atacagatta tcgtgatttt 
ctagatactg gggcaagggt ttcttaccct 

10 ggtgaagtta agattttcag agcactcaac 
atggtggcag atattgtaat aaatggtgtt 
acagtttcga caccgactgg tagtactgcc 
caccctacca ttgaagcttt gcaattaacg 
cgaacactgg gctcttccat tattgtgcct 

15 agaaacgatt atcatactat ttcggttgac 
cgtattgagt atcaaatcga ccatcataag 
agtttctgga accgtg.ttaa ggatgccttt 

<210> 100 

20 <211> 1623 

<212> DNA 

<213> Streptococcus pneumoniae 



cgaattgatc tgatagccaa tagaaaaccg 60 
gatcgtttga agagaaatca gtttatactc 120 
attggcgggg atggtatgct cttgtcggcc 180 
gtccgcttta tcggtcttca tactggacat 240 
gagttggaca agctagtgac taatttgcag 300 
gttctgaatg tgaaggtctt tcttgaaaat 360 
gaagccagca tccgcaggtc tgatcgaacc 420 
ccctttgaac gttttcgtgg agacgggcta 480 
tataacaagt ctcttggcgg tgctgtttta 540 
gaaattgcca gccttaataa tcgtgtctat 600 
aagaaggata agattgaact tattccaaca 660 
aatagcgttt attctttccg taatattgag 720 
attcactttg tcgcgactcc tagccatacc 780 
atcggtgagg tggatgaatg a 831 



<400> 100 

25 atgtcaaaag aaattaaatt ttcatcagat 
attcttgcag acactgttaa agtaaccttg 
aagtcattcg gttcaccctt gattaccaat 
ttggaagacc attttgaaaa tatgggtgct 
aatgatatcg caggtgacgg aactacgact 

30 gaaggaatca aaaacgtcac agcaggtgca 
acagcagttg ccgcagcagt tgaagctttg 
gaagctatcg ctcaagttgc agcagtatct 
tctgaagcaa tggaaaaagt tggcaaagac 
atggaaacag agcttgaagt cgtagaagga 

35 tacatggtga cagatagcga aaaaatggtg 
acagacaaga aaatttccaa catccaagaa 
agcaatcgtc cactcttgat tattgcggat 
gttttgaaca agattcgtgg aaccttcaac 
gaccgtcgca aagccatgct tgaagatatc 

40 gaagaccttg gtcttgagtt gaaagatgcg 
gtgaccgtgg acaaagatag cacggttatt 
tctcaccgtg ttgcggttat caagtctcaa 
gaaaaattgc aagaacgctt ggccaaattg 
gccgcaactg aaactgagtt gaaagaaatg 

45 actcgtgcag ctgttgaaga aggtattgtt 
attccagctg ttgctacctt ggaattgaca 
ctccgtgctt tggaagaacc cgttcgtcaa 
atcgttatcg atcgtttgaa aaatgctgag 
gagtgggtta acatgattga tcaaggtatc 

50 ctacaaaatg cagcatctgt agccagcttg 
aaaccagaac cagtagcccc agctccagca 
taa 

. <210> 101 
55 <211> 1446 
<212> DNA 

<213> Streptococcus pneumoniae 



gcccgttcag ccatggttcg tggtgtcgat 60 
ggaccaaaag gtcgcaatgt cgttcttgaa 120 
gacggtgtga ccattgccaa agaaatcgaa 180 
aagttagtat cagaagtagc ttctaaaacc 240 
gcaacagtct tgacccaagc tatcgtccgt 300 
aatccaatcg gtattcgtcg tgggattgaa 360 
aaaaacaacg ccatccctgt tgccaataaa 420 
tctcgttctg aaaaagttgg tgagtacatc 480 
ggtgtcatca ccatcgaaga gtcacgtggt 540 
atgcagtttg accgtggtta cctttcacag 600 
gctgaccttg aaaatccgta cattttgatt 660 
atcttgccac ttttggaaag cattctccaa 720 
gatgtggatg gtgaggctct tccaactctt 780 
gtagtagcag tcaaggcacc tggttttggt 840 
gccatcttaa caggcggaac agttatcaca 900 
acaattgaag ctcttggtca agcagcgaga 960 
gtagaaggtg caggaaatcc tgaagcgatt 1020 
atcgaaacta caacttctga atttgaccgt 1080 
tcaggtggtg tagcggttat taaggtcgga 1140 
aaactccgca ttgaagatgc cctcaacgct 1200 
gcaggtggtg gaacagctct tgccaatgtg 1260 
ggagatgaag caacaggacg taatattgtt 1320 
attgctcaca atgcaggatt tgaaggatct 1380 
cttggtatag gattcaacgc agcaactggc 14 40 
attgatccag ttaaagtgag tcgttcagcc 1500 
attttgacaa cagaagcagt cgtagccaat 1560 
atggatccaa gtatgatggg cgggatgatg 1620 

1623 
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<400> 101 

atgattaaga ttgaaaccgt attagatatt 
attgaccaag gtcattacca ctacaactac 
5 gacagccgaa aagtaacaga agacactctt 
gaataccttc tttctgctat aacacaaggt 
gaagtcgata tccctgtcat cattgtgaac 
atggagttct atggtaatcc acaagagaaa 
ggtaagacaa cagcaaccta tttcgcctat 

10 atgttgtcga ccatgaacac aactcttgat 
acccctgaga gtattgacct ctttgacatg 
cacctcatca tggaagtctc cagtcaagcc 
tttgatgtag gagtctttct taacatcact 
agctttgaag actatttcta ccacaagcgt 

15 attaacagtg acatggacca cttctcagtc 
gatttctatg gtagccaatt. tgataaccaa 
gctacgggta aactcgctgg agattatgat 
aatgcagttg ctgctggact tgcttgtctc 
aaaggcatcg ctgcaacccg cgttcctggt 

20 gccaaggtct tcatcgacta tgcccacaat 
gttgaaactc atcaaaccgg aaagattgct 
gaaagtcgtc gtaaggactt tggcctcctc 
ctgactgctg atgaccctaa ctatgaagac 
tacatcaatc atcctgttga aaagattgcg 

25 gctatcacaa atcacgaatt agatgcagtt 
caaatcatcc agggcaagaa agaatcctac 
ttataa 

<210> 102 
30 <211> 1980 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 102 

35 atgatccaaa tcggcaagat ttttgccgga 
ggaggtatgg cggatgtcta cctagccaaa 
gtgaaggttc tgaggaccaa ctaccagacg 
gaagcgagag ctatggcaga tctagaccat 
gaggaagacg gtcaacagta cctagctatg 

40 tatatcaagg aacattatcc tctttctaat 
ctcttggcta tgcgcttggc ccatactcga 
aatatcctct tgacaccaga tgggactgcc 
tttgcagaga caagtctgac ccagactaac 
ccagagcagg cgcgtggttc gaaggcgact 

45 attttctatg agatgctgac aggccatatc 
gccctccagc atttccagaa acccctgccg 
caggctttag aaaatgttat tatcaaggca 
tcggtttcag agatgtatgt ggacttgtct 
agtaagttaa tctttgatga aacgagcaag 

50 cagagtacct tgacatctat tcctaaggtt 
aacccaagcc aggctgtgac agaggaaact 
tttaagatgc gttacctgat tttgttggcc 
tggatactat ccagaactcc tgcaaccatt 
gcagaggcca aggcaacgct caaaaaagcc 

55 gctagtgaaa aggtggaaga agggcggatt 
cgaaaagaag gaacgaaaat caatttggtt 
agtaattatg tcggtcggaa atcctctgat 



ttaaagaaag atggcctttt tcgcgaaatt 60 
agcaaagtta tttttgatag catcagctac 120 
ttttttgcaa aaggcgctgc ctttaaaaaa 180 
ttagcttggt atgtagctga aaaggactac 240 
gatataaaga aagccatgag tttgattgcc 300 
ctcaaactcc ttgcctttac tggtactaag 360 
aacatcttat ctcaggggca tagacctgct 420 
ggcgagactt tctttaagtc agcgttgaca 480 
atgaatcagg ctgtgctaaa tgaccgtacc 540 
tatctagtcc atcgagtcta tggactgacc 600 
cctgaccata tcggcccgat tgaacaccct 660 
ctcttgatgg aaaatagccg agcagtcatc 720 
ttgaaagaac aggttgaaga tcaagaccat 780 
atcgagaatt ccaaagcctt tagcttttca 840 
atccaactca ttggcaactt caaccaagaa 900 
cgtctcggag caagtcttga ggacatcaaa 960 
cgtatggaag tcctcactca gaaaaatgga 1020 
ggggatagtc tgaaaaaact catcaatgtg 1080 
ctggttctgg gatcaacagg aaacaaggga 1140 
ctcaatcaac accctgagat tcaagtcttt 1200 
ccaatggcca ttgcagatga aattagtagc 1260 
gatcgccaag aagccatcaa ggcggcaatg 1320 
attattgcgg gtaagggagc cgattgttac 1380 
ccaggagata cagccgtcgc agaaaattat 1440 

1446 



cgctatcgga ttgtcaaaca gattggtcga 60 
gacttaatct tagatgggga agaagtggca 120 
gacccgatag ctgtagctcg ttttcagcgt 180 
cctcatatcg ttcggataac agatattggc 240 
gagtatgtgg ctggactgga cctcaaacgc 300 
gaagaagcag tccgtatcat gggacaaatt 360 
ggaattgttc acagggactt gaaacctcaa 420 
aaggtcacag actttgggat tgctgtagcc 480 
tcgatgttgg gctcagttca ttacttgtca 54 0 
gtgcagagtg atatctatgc catggggatt 600 
ccttatgacg gggatagcgc ggtgaccatt 660 
tccgttattg cagaaaatcc atctgtacct 720 
actgctaaaa agttgaccaa tcgctaccgc 780 
agtagcttgt cctacaatcg tagaaatgaa 840 
gcagatacca agaccttgcc gaaggtttct 900 
caagcgcaaa cagaacacaa atcaatcaaa 960 
taccaaccac aagcaccgaa aaaacataga 1020 
agccttgtat tggtggcagc ttctcttatt 1080 
gccattccag atgtggcagg tcagacagtt 114Q 
aattttgaga ttggtgagga gaagacagag 1200 
atccgtacag atcctggcgc tggaactggt 1260 
gtctcatcag gcaagcaatc tttccaaatt 1320 
gtcattgcgg aattaaaaga gaaaaaagtt 1380 
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ccagataatt tgattaaaat tgaggaagaa 
ctgaagcaaa gtctaccaga aggtacgacc 
ttgacagtag ctaaaaaagc tacgacgatt 
acagaagtaa tctcagaact caagcagaag 
5 gaagaagagt ccagcgaaag cgaaccagga 
acgacttatg atgtgagtaa acctactcaa 
agtgttgcca tgccgagtta cattggttct 
caaattgttg ggattaagga agctaatata 
ggtagtgcag aaggcatggt tgttgaacaa 
10 aataagacta gagtcaagat ttcaatctac 

<210> 103 

<211> 1176 

<212> DNA 

15 <213> Streptococcus pneumoniae 



PCT/US00/35604 

gagtcgaatg agagtgaggc tggaacggtc 1440 

tatgacttga gcaaggcaac tcaaattgtt 1500 

caattaggga actatattgg acggaactct 1560 

aaggttcctg agaatttgat taagatagag 1620 

acgattatga aacaaagtcc aggtgccgga 1680 

attgtcttga cagtagctaa aaaagttaca 1740 

agcttggagt ttactaagaa caatttgatt 1800 

gaagttgtag aagtgacgac agcgcctgca 18 60 

agtcctagag caggtgaaaa ggtagacctc 1920 

aaacctaaaa caacttcagc tactccttaa 1980 



<400> 103 

atgaaacatt 

tccagtaact 

20 aaattagctg 
ctgctagctg 
aatcatgaca 
ggacgcgtct 
atcactgaac 

25 gatgaccagt 
acaacaggtg 
cgccatttta 
tttccacata 
catgtcatca 
.30 cgcatgtcta 
tctgagaagg 
ttaaaaacct 
gtcaaacagc 
attcctgtaa 

35 ctcaaggaaa 
ggcgaagtta 
accggctggg 



ttgatactat 
tttatggaca 
ggactggtgg 
gaattcctgg 
tcatcaactt 
ttccagccag 
taggtggtca 
ttgtccttaa 
gtaagtctta 
agcataccat 
aagccttaca 
ctcatgattt 
gctttgtcaa 
acttggttac 
tgttaccaga 
tgactgaaaa 
ctggaaaaat 
tcaatcctaa 
tggatatcaa 
tggcgggaag 



tgtcatcggt 
gaaaaccctc 
gggacgttgc 
aaacggacgc 
ttttacagaa 
tgacaagtct 
agttgctact 
gtcagcggat 
tccttcgact 
caccgatctt 
agggatttct 
actctttacc 
aggtggggag 
atttctagaa 
acgcttggcc 
ggaacgagaa 
gtcccttgca 
aacccttgaa 
tgcccacacg 
tctgcattat 



gggggacctg 
ctcatcgaaa 
aatgtgacca 
tttctttaca 
aatggtgtta 
cggactatta 
caaatagaaa 
caaaccttca 
ggttcgactg 
gaggctgctg 
ctggacgatg 
cactttggtt 
gttctctcac 
gaaaatcggg 
gaattttttg 
caacttgtcc 
aagtcctttg 
agtaagctgg 
ggtggcttta 
gattaa 



ctggtatgat 
aaaatcggaa 
acaatggtag 
gtgttttctc 
aacttaaggt 
tcgaagcttt 
tcgtttctgt 
cttgtgagaa 
gttttggtca 
aaagtccttt 
tgaccctaag 
tgtcaggtcc 
tcgatgtttt 
aaaaatcctt 
tacaaggata 
agtccattaa 
ttaccaaggg 
tacctggcct 
acatcacttc 



ggctacgatt 
acttggaaaa 
cttagacaac 
ccagttcgat 
cgaagaccac 
ggaaaagaaa 
taaaaaagta 
actcattgtc 
cgagattgct 
attaacagat 
ttatggtaag 
tgctgcccta 
gcctcaactt 
gaaaaacgct 
tcctgaaaaa 
agaacttaaa 
tggagtcagt 
ccactttgca 
tgccctctgt 



60 

120 

180 

240 

300 

360 

420 

4 80 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1176 



<210> 104 
40 <211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 104 
45 , atgctgaaat 
cagcttgtct 
gccttggtgc 
aagttggata 
cggttcaaga 
50 gtgacttctg 
ttggacgaac 
cgacctgaag 
ttgcaagcag 
agtcaaatga 
55 gaaaagatgc 
atcatgtttc 



gggaagactt 
ctaaaaggaa 
ttacatgggt 
gcaaggggcc 
tttggaagtt 
ctaacgatag 
tgcctcagtt 
tgccacgtta 
gaattacctc 
cggagaaagg 
gctataacct 
aaaccgtgtt 



gcctgtggaa 
gggttcgctg 
tctaacttct 
agtgatttac 
ccgtaccatg 
ccgtattacc 
ggtcaatgtc 
tacagagcag 
tccagccagc 
tctgtcagtt 
cgcctatctc 
tgaggtacta 



atgaaatcaa 
attttcaagc 
cccatctttc 
aagcaagagc 
gtgacggatg 
aaggttggaa 
cttaaaggtg 
tatagccctg 
atcaactaca 
gatcaggcct 
cgagagttta 
aaataa 



gcgaggttga 
gttgcttgga 
tcatcttgag 
gtgtgaccca 
cggataaaaa 
atttcatccg 
agatgtcctt 
aaatgatggc 
aggatgagga 
atgtggagca 
gtttctttgg 



gtcttactac 60 
ctgggttttg 120 
catttggatc 180 
gtacaaccgt 240 
aggaagtctg 300 
acgtgtccgt 360 
tgtcggtaca 420 
aaccttgctc 480 
caccatcatc 540 
tgttcttcct 600 
ggacatcaaa 660 
696 
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<210> 105 
<211> 423 
<212> DNA 

<213> Streptococcus pneumoniae 

5 

<400> 105 

atgactagtc cactattaga atctagacgc 
atgagccttg agttcggtac ggatgtcgaa 
cgtgaatata cggatgtaca acttccagcc 
10 gctaaaaagg aagaactaga taagcaaatc 
gaacgcttaa cgctcgtgga gagaaacctc 
tttgacactc ctcagctggt tgctgttaat 
gatcaaaaat ctgcccgttt tatcaatgga 
taa 

15 

<210> 106 
<211> 3540 
<212> DNA 

<213> Streptococcus pneumoniae 

20 

<400> 106 

atgtatttaa aggaaatcga aattcagggg 
gtttttgacc aaggtgtgac ggcagttgtt 
acagaaagtc tgcgttgggc tttgggggag 

25 atgccggatg tcatctttgc tggaacagaa 
gttgtgactc tggataatca tgacggattt 
gaacgccata tctatcgtag tggagatagc 
ctgcgtgata ■ ttcatgacct cttcttggat 
atttcccaag ggaaggttga ggagattttt 

30 tttgaagaag ctgctggagt tttaaaatac 
ctgcagcaaa ctcaggataa tctggaccgc 
caaatcaagc ctcttgagaa gcaagctgag 
caacgtaagg ctatttattt agacgttctg 
ctagagtcga cagaagaaga gttggctcag 

35 aagcgtgaaa aattagaaga agaaaatcaa 
gctgaaatgg ccaaagacca aggcagtttg 
gaaagaaaat tagccctatc gaaactggag 
gcacaagctc gtttggctgc tttggaggat 
gataaagaaa gctctttagc tctgttagag 

40 aatcgtttag aagctgaatt gctggctttc 
ttacgtgaac gctttgtagc tcttttacaa 
cgtattgaga atgagttgga aaatagtcgt 
gaaaagctga aagagcaatt agctacagct 
cttgaaactg ccaaggtgca ggttcagaaa 

45 gagcaagagg agcagaaaac ttcctatcaa 
gatagtctca aaaacaagca ggccagagct 
agtaactttt atgcaggtgt taagagtgtt 
attggtgcag tcagtgagca tctgaccttt 
gccttagggg caagtagcca gcatatcatc 

50 attgatttcc tcaaacgaaa cagagtcggt 
aaggcgcgta cgatttctag tcagaaccaa 
gggatggcag atgagttggt gacttttgat 
ctagctacga cggctatctt tgataccgta 
cgttatcagg ttcgtatggt gacattggat 

55 gcgggtggtg ccaatcgcca aaataacagt 
caaaaagaaa ttgctgcaga tgaagcaagc 
ttgcaagacc agatggctgc attgacagaa 



caactccgta aatgcgcttt tcaagctctc 60 

actgcttgtc gtttcgccta tactcatgat 120 

tttttgatag acctcgtttc tggtgttcaa 180 

actcagcatt taaaagcagg ttggaccatt 240 

cttcgcttgg gagtctttga aatcacttca 300 

gaagctatcg agcttgcaaa ggacttctcc 360 

ctgctcagcc agtttgtaac agaagaacaa 420 

423 



ttcaagtctt ttgctga'taa gaccaaggtc 60 
ggacccaatg gatctggaaa gtccaatatt 120 
tctagtgtca agagtctccg tgggggcaag 180 
agtcgcaaac cgctcaatta tgcttctgta 240 
atcaaggatg caggtcaaga aatcagggtg 300 
gaatacaaga ttgacggcaa gaaagtccgt 360 
actggattgg gacgagattc cttctctatt 420 
aattccaagc ctgaggaacg acgagctatt 480 
aagactcgca gaaaagaaac cgagagtaaa 540 
ttagaggaca ttatctacga gttggataat 600 
aatgcccgta agtttttaga cttggaagga 660 
gttgctcaaa tcaaggaaaa taaggcagaa 720 
gttcaagaac tcttgatgag ttattaccaa 780 
actcttaaaa agcaacgcca agatttacag 840 
atggacttga ctagtctgat tagtgattta 900 
tccgagcaag tggccctgaa tcaacaggag 960 
aagagaaatt cactcagcaa agaaaagtat 1020 
ggaaatctag tccaaaataa tcaaaaactc 1080 
tcagacgatc ctgatcagat gattgagctc 1140 
gaagaagcgg atgtctcaaa ccagttgacc 1200 
cagctttctc aaaaacaagc agatcaacta 1260 
aaagagaagg ctagtcagca aaaagacgag 1320 
ttattggctg actatcaagc tattgccaag 1380 
gctcaacaaa gtcaactctt tgaccgtctg 1440 
caaagtttgg aaaatatcct gagaaatcat 1500 
ctccaagaaa aagatcgcct aggtgggatt 1560 
gatgtttatt atcaaactgc cctagagatt 1620 
gtagaagatg aagagtcggc aaccaaagct 1680 
cgtgcaacct ttcttccttt gaccactatt 1740 
gatgctatcg ctgtaagccc aggtttcctt 1800 
actagactgg aagccatttt caagaacttg 1860 
gaacatgcgc gtgaagctgc tcgacaagtt 1920 
gggacagaat tacgcacggg tggttcctat 1980 
attttcatca agccagaact ggagcaatta 204 0 
ttgggttcag aagaagcggc tttgaagacc 2100 
agattagaag ccatcaaatc tcaaggagag 2160 
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caggcacgta ttcaggagca aggcttgtcc ctcgcttatc agcaaactag tcagcaagtt 2220 
gaagaactgg aaactctttg gaaactccaa gaagaggaaa tagatcgtct ttctgaggga 2280 
gattggcaag cggataagga aaaatgtcaa gagagccttg ctactatcgc cagtgacaag 2340 
caaaatctgg aagctgagat tgaagaaatt aagtctaata aaaacgccat ccaagaacgc 2400 
5 tatcaaaatt tgcaggaaga ggtggcgcaa gctcgcttgc ttaagacaaa actgcaaggg 24 60 
caaaaacgtt atgaagtagc tgatattgag cgtttaggca aggaattgga caatcttaat 2520 
atcgaacaag aagaaattca gcgcatgctc caagaaaaag ttgacaatct tgagaaggtt 2580 
gatacagaat. tgctcagtca acaggcggaa gaatccaaaa ctcagaaaac aaatctccaa 264 0 
caaggtttga ttcgcaagca gtttgagttg gatgatatag aaggtcaact ggatgatatt 2700 

10 gccagtcact tggatcaagc tcgccagcag aatgaggagt ggattcgcaa gcaaacacgt 27 60 
gctgaagcca agaaagaaaa ggtcagcgag cgcttgcgcc atctacaaaa tcaattaaca 2820 
gaccagtacc agattagcta tactgaagca ctagaaaagg cacatgaatt ggaaaacctc 2880 
aatctggcag agcaagaggt gcaggattta gagaaggcta ttcgctcatt gggacctgtc 294 0 
aacttggaag ctattgacca gtacgaagaa gttcacaacc' gtctggactt tctaaatagt 3000 

15 cagcgagatg atattttgtc agcgaaaaat ctgctccttg aaaccattac agagatgaat 3060 
gatgaggtca aggaacgctt taaatcaacc tttgaagcta ttcgtgagtc ctttaaagtg 3120 
acctt.caagc agatgtttgg cggaggtcag gcagacttga tattgactga gggcgacctt 3180 
ttaacagctg gtgtggagat ttctgttcaa cctccaggta agaaaatcca gtcgcttaac 3240 
ctcatgagtg gtggtgaaaa agctctatcg gctcttgcct tgcttttctc cattattcgt 3300 

20 gtcaagacca ttccttttgt catcttggat gaggtggaag ctgcgctgga tgaagccaat " 3360 
gttaaacgtt ttggggatta cctcaaccgc tttgacaagg acagccagtt tatcgtcgta 3420 
acccaccgta agggaaccat ggcagcggct gattccatct atggagtgac catgcaagaa 34 80 
tcaggtgtct caaaaattgt ttcggttaag ttaaaagatt tagaaagtat tgaaggatga 3540 

25 <210> 107 
<211> 1344 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 107 

atgacaaaac gtgtaacgat tattgacgta aaagactatg ttggtcagga agtgacgatt 60 
ggcgcttggg ttgccaacaa atcaggaaaa ggaaaaatcg ctttcttaca attgcgtgat 120 
ggaacagcct tctttcaagg tgtggctttt aaaccaaact ttgtcgaaaa atttggtgaa 180 
gaagtgggac ttgagaagtt tgatgttatc aaacgcttga gccaagaaac gtctgtttat 240 
gtgacaggta ttgtcaaaga ggacgaacgt tctaaatttg gctatgagtt ggacatcaca 300 
gacatcgaag tgatcggtga atctcaagac tacccaatca caccaaaaga acacggaaca 360 
gactttttga tggataaccg tcacttgtgg ctacgctctc gtaagcaagt agctgtgttg 420 
caaatccgta acgctattat ctatgcaact tatgagttct ttgacaagaa cggttttatg 480 
aagtttgaca gcccaattct ttcaggaaat gcggcagaag attctacaga actctttgaa 540 
actgactact tcggaacgcc agcctacttg- agccaatcag gtcagcttta cctagaagca 600 
ggggctatgg ctcttggtcg tgtctt'tgac tttggtccag ttttccgtgc tgaaaaatca 660 
aaaacacgcc gtcacttgac tgagttctgg atgatggatg ctgagtactc atacttgaca 720 
catgatgagt cgcttgactt gcaagaagct tatgtgaaag ctcttctaca aggtgttctt 780 
gaccgcgcgc ctcaagcctt ggaaaccttg gaacgtgata cagaactctt gaaacgctac 840 
attgcagagc cattcaaacg tatcacttac gatcaagcca ttgacctctt gcaagagcat 900 
gaaaatgatg aagatgctga ctacgagcat cttgagcatg gtgatgactt tgggtcacca 960 
cacgaaactt ggatttcaaa ccactttggt gtgccaacat ttgtcatgaa ctatccagca 1020 
gccatcaagg ccttctacat gaaaccagtt cctggaaatc cagagcgcgt gctttgtgca 1080 
gacttgcttg ctccagaagg ctatggagaa attatcggtg ggtctatgcg tgaggaagat 1140 
tacgatgccc ttgtcgctaa gatggatgaa cttggcatgg atcgtacaga atatgaattc 1200 
taccttgacc ttcgtaaata cggtacagtt ccacacggag gatttggtat cggtatcgaa 1260 
cgtatggtaa ccttcgcagc aggaacaaaa catatccgtg aagctattcc attcccacgt 1320 
atgttgcacc gtatcaaacc ataa 134 4 

55 <210> 108 
<211> 927 
<212> DNA 
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<213> Streptococcus pneumoniae 



10 



15 



20 



25 



30 



35 



40 



45 



<400> 108 

atgtctgaaa aattagtaga aatcaaagat ttagaaattt ccttcggtga 

aagtttgtcg cggttaaaaa tgctaacttc tttatcaaca agggagaaac 

gtaggtgagt ccggtagtgg gaaaacaact attggtcgtg ctatcatcgg 

acaagtaatg gagatatcat ttttgatggt caaaagatta atggtaagaa 

caagctgcgg aattgattcg tcgaatccag atgattttcc aagaccctgc 

aatgaacgtg cgactgttga ttatattatt tctgaaggtc tttacaatca 

aaggatgaag aagaacgtaa agagaaagtt caaaatatta tccgtgaagt 

gctgagcact tgactcgtta ccctcatgaa ttctcaggcg gtcaacgtca 

attgcccgtg ccttggtcat gcaaccagac tttgttattg cagatgagcc 

ttggacgttt ctgtacgtgc ccaagtcttg aacttgctca aaaaattcca 

ggcctgacct atctcttcat cgcccatgac ttgtcggttg ttcgctttat 

atcgcagtta tttacaaggg tgttattgta gaggttgcag aaacagaaga 

aatccaattc acccatatac tcaagccttg ctttcagcgg taccaatccc 

ttggaacgta agaaggtctt gaaggtttac gacccaagtc aacacgacta 

aagccgtcta tggtagaaat ccgtccaggt cactatgttt gggcgaacca 

gcacgttatc aaaaaggact aaactag 

<210> 109 
<211> 1275 
<212>" DNA 

<213> Streptococcus pneumoniae 
<400> 109 

atgaagataa gttggaatgg attttctaaa aaatcatacc aagagcgcct 
aaagctcagg cgctccttag tcctgagaga caagctagtc tggagaagga 
agtgtgactg tggcagacca gctgagtgag aatgtggtgg gaactttttc 
tcgctggttc cggaggtact tgtcaacggt caggaataca ccgttcccta 
gaaccctctg tggttgcggc ggccagctat gccagcaaaa tcatcaagcg 
tttactgcac aagtccatca gcgccagatg attgggcagg tagcccttta 
aatcctaaac tagcgcaaga gaagattgcc agcaagaaag cggagctctt 
aatcaagcct atccttctat cgttaaacgt gggggtgggg cgcgtgatct 
cagataaaag gcgaaccaga ctttctcgtt gtttatattc atgtcgatac 
atgggtgcca atatgctcaa caccatgctg gaagccttga aaccagtctt 
agtcagggac agagtctcat gggaatcctg tccaactacg cgactgattc 
gcaagctgtc gcatcgcctt tcgctacttg agccgccaaa aggatcaagg 
gcggagaaaa ttgcgttggc tagtcagttt gcgcaggctg atccttaccg 
cataataaag gaatttttaa tggtattgat gcgattttga ttgccactgg 
cgtgccatcg aagctggggc ccatgccttt gccagtcgag atggacgcta 
agctgctgga cgctggacct tgaaagagaa gaattggtcg gtgagatgac 
cctgtagcga ctaagggtgg ctctatcggc ctcaacccac gtgtagctct 
ctactaggaa atccttctgc cagagaatta gcccagatta tcgtgtccat 
caaaattttg cagccctcaa agccttggta agtacgggca tccagcaagg 
ctacaggcca aatccctagc tctcctagct ggggctagtg aatctgaagt 
gtagagcgcc tcatctcaga taaaaccttt aacctagaga cagcccagcg 
aatttaagat cataa ■ 



aggaagtaag 60 
tttctcgctt 120 
tctaaatgat 180 
atcgcgtgaa 240 
cgcaagtttg 300 
ccgtttattt 360 
aggtcttctt 420 
acgtatcggt 480 
aatttcagcc 540 
aaaagagctc 600 
ttcagatcgt 660 
attgtttaac 720 
agatccaatc 780 
tgagactgat 84 0 
aaccgaattg 900 
927 



cgagctgcta 
tgaacagatg 
tctgccttat 
tgtgacagaa 
tgcaggtggt 
tcaaattgct 
ggagcttgcc 
gcatgtcgag 
ccaggaagcc 
agaagaactc 
tctggtgact 
acgagagatt 
agctgctact 
taatgactgg 
tcaaggtctt 
cctgcccatg 
cagtcatgat 
cggtcttgct 
ccacatgaaa 
tgctccccta 
ctatctcgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1275 



50 <210> 110 
<211> 789 
<212> DNA 

<213> Streptococcus pneumoniae 



55 <400> 110 

atgccaatta catcattaga aataaaggac aagacttttg gaactcgatt cagaggtttt 60 
gatccagaag aagtcgatga atttttagat attgtggttc gtgattacga agatcttgtg 120 



46 



WO 01/49721 



PCT/USOO/35604 



cgtgcgaatc atgataaaaa tttgcgtatt 
gatgaaataa aagattcatt gagccagtct 
gtgaaacagg cggcgcatga acgttcaaac 
caacgcttgt tggaagaagc taaatataag 
5 aatgctaaga aagtcgctgt tgaaacagaa 
caacgtctca aatctacaat tgagagtcag 
gatattctcc gtccaacagc tacttatctt 
gttagcgaag tacttggaga accgattcca 
acacgtcagt tctctcaagc agaaatggca 
10 aaagaattgt ctgaatttga agctcagatt 
gtgagtcctc aagttgaaga agagcctctg 
cagaagtag 

<210> 111 
15 <211> 1728 
<212> DNA 

<213> Streptococcus pneumoniae 



aagagtttag aagagcgttt gtcttacttt 180 
gtattgattg ctcaggatac agctgagaga 24 0 
aatatcattc atcaagcaga gcaagatgcg 300 
gcaaacgaga ttcttcgtca agcaactgat 360 
gaattgaaga acaagagccg tgtcttccac 420 
ttggctattg ttgaatcttc agattgggaa 480 
caaaccagtg atgaagcctt taaagaagtg 54 0 
gctccaattg aagaagaacc aattgatatg 600 
gaattacaag ctcgtattga ggtagccgat 660 
aaacaggaag tggaagctcc aactcctgta 720 
ctcatccagt tggcccaatg tatgaagaac 780 

789 



<400> 111 

20 atgtctaatg gacaactaat ttatttaatg gttgcaattg cagtcatttt agttctggct 60 
tatgtagtgg caatctttct acgtaagcga aacgagggga gattagaggc gctagaagaa 120 
agaaaagaag aactatacaa tcttccagta aatgatgaag tagaagctgt aaaaaatatg 180 
cacttgattg gacaaagtca agtggctttc cgtgaatgga atcaaaaatg ggtcgattta 240 
tctctcaact cttttgccga tattgaaaat aatctctttg aagcagaagg ttataaccat 300 

25 tcatttcgtt ttctcaaggc cagtcatcaa attgaccaaa ttgagagtca aattactttg 360 
attgaagaag atattgcggc aattcgcaat gctttggcag acttagagaa gcaagaatct 420 
aaaaatagtg gtcgtgttct tcatgctttg gatttatttg aggaacttca gcatagagtt 480 
gctgaaaatt cagaacagta tggtcaagcc ttggatgaaa ttgaaaaaca attagaaaat 540 
atccaatctg aattttcaca atttgtaacc ttgaattcat cgggtgaccc tgtggaagcc 600 

30 gcagtgattt tggataatac agaaaatcac attttggcct taagtcatat tgtggatcgt 660 
gttccagcct tggttacgac gctttctaca gaattgccag atcaattaca ggatttggaa 720 
gccggttatc gtaaactaat tgatgctaat tatcattttg ttgaaacgga tattgaagcg 780 
cgtttccact tgctttatga agcattcaag aaaaaccaag agaatattcg tcagttggaa 840 
ttggataatg ccgaatatga gaatggacag gcacaagagg aaatcaatgc cttgtatgat 900 

35 atttttactc gagaaattgc tgctcagaaa gtagtggaaa atctacttgc aactcttcca 960 
acttaccttc aacatatgaa agagaataat actttattgg gagaagatat tgcacgtttg 1020 
aacaagacct atttacttcc tgagacagct gcaagccatg ttcgtcgtat tcagacagaa 1080 
ttagagagtt ttgaggcagc tattgttgag gtaacttcaa atcaagaaga accaacccaa 114 0 
gcttattcag ttcttgaaga aaatcttgag gatttacaaa ctcaactaaa agatattgaa. 1200 

40 gatgagcaaa tttcagttag tgagcgcctg acacaaattg agaaagatga tattaatgca 1260 
cgtcaaaagg ccaatgttta tgtcaatcgt ctccatacta tcaagcgata catggaaaaa 1320 
cgcaatctgc caggtattcc acaaactttc ttgaagttat tctttacggc aagcaataat 1380 
accgaggatt taatggttga gttagaacaa aaaatgatta acattgaatc tgttacccga 1440. 
gttcttgaaa ttgcaacgaa tgatatggaa gctttagaaa cggaaactta taatattgta 1500 

45 caatatgcaa ctttgacaga gcaactcttg caatattcta accgctatcg ctcatttgat 1560 
gaacgcattc aagaagcatt taacgaagct ttagatattt ttgaaaaaga atttgattat 1620 
cacgcttcat ttgataagat ttctcaagca ttggaagtgg cagagcctgg tgtaaccaat 1680 
cgctttgtta cctcatatga gaaaacacgt gaaacgattc gtttttaa 1728 

50 <210> .112 
<211> 2403 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 112 

atgcttatat cttataaatg gttaaaagaa ttggtggaca ttgatgtgcc atcacaagag 60 • 
ttggctgaaa aaatgtcaac tacaggaatc gaggtagagg gtgtcgaatc accagctgct 120 
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ggtctctcaa aaattgtcgt cggtgaggtc 
ctccatgttt gtcaggttaa cgttggcgaa 
aatgtgcgtg ctgggatcaa ggtcatggtg 
tacaaaatca aaaaaggaaa aatccgtggt 
5 ggtgaattgg gaatttctga ctcagttgtg 
ttgcctgaag atgccgtgcc atjgtgaggaa 
atcatcgaac tttccatcac accaaaccgt 
cacgaagtgg cagccatcta tgacaaggca 
actaatgaag ctgcggcaga tgccctttct 

10 tatgcagctc gtatcttgga caatgtgacc 
cttctcatga acgaaggaat ccgtcccatc 
ctgctctatt ttggtcaacc aatgcatgcc 
atccgtgtgc gtgaagcgcg tgctggtgaa 
gacttggacg tgaatgacct agtcatcact 

15 gtcatgggtg gtcaagcaac agaaatctct 
gctgttttca atggcaaatc tatccgtaag 
tcatcttctc gctttgaaaa aggaattaat 
gcagctagcc tgattgcgga acttgcaggt 
ggtgagcttg atacttcaga tgtagaagtt 

20 ctcggaactg agctgtctta tgctgatgta 
ctttctggaa atgcagacag ctttacagtc 
atcgaagctg acctctttga agaaattgct 
agtctaccaa aagacgatgg tacagcaggt 
caagttcgta ctattgctga aggagcaggt 

25 actcctgaaa aagcagttga gtttacggct 
ccaatgactg tggatcgttc agtcctccgt 
gttgcctaca acgtggctcg taagaataaa 
tttgaacaaa caggtaatcc aaaagaagaa 
gccttgacag gcttggttgc tgaaaaagat 

30 ttctatgcta agggaatcct tgaagcccta 
acagcaacat • ctgaaatcgc tagccttcat 
gaccaagttc ttggtttcct tggccaagtg 
ccagaaacgt atgtggctga gcttaacctt 
actccatttg tagaaatcac caaattcccg 

35 aaggcagaag tgactcatca agaagttgta 
ttgacagata tcaaactctt tgacgtcttc 
tcaatggctt atagcttgac cttccaaaat 
gcacgctata tggaaaaaat ccaagcatcg 
taa 

40 

<210> 113 
<211> 543 
<212> DNA 

<213> Streptococcus pneumoniae 

45 

<400> 113 

atgttagaaa acgatattaa aaaagtcctc 
aaaaaactag gtgctcaatt aactaaagat 
attttaaaag gatctattcc ttttatggct 

50 gaaatggac.t tcatgatggt ttctagctac 
aatattaaac aagatgtgac tcaagatatc 
atcattgata caggtcaaac tttgaagaat 
gcttctgtta aaattgcaac cttgttggat 
gcagactata cctgctttac tatcccaaat 

55 aaagaaaatt atcgtaatct tccttatatt 
tag 



ttgtcttgcg aagatgtgcc agagactcac 180 
gaagagcgtc agatcgtttg tggtgcccca 240 
gctcttccag gagctcgtat cgctgataac 300 
ttggagtcac ttggaatgat ctgttcactt 360 
cctaaggaat tcgcagatgg catccaaatc 420 
gtcttttctt acctagactt ggatgatgaa 480 
gcagatgccc tttctatgtg tggagtggct 540 
gtcaacttta aagaatttac tctaacagaa 600 
gtcagcattg agacagacaa ggcgccttac 660 
atcgcaccaa gtccacaatg gttgcaaaac 720 
aataacgtag tggacgtgac caactacatc 780 
tttgacttgg ataactttga agggactgac 840 
aaattggtga ccttggacgg tgaagaacgt 900 
gtcgcagaca agccagtagc ccttgcaggt 960 
gaaaaatcta gtcgtgttgt ccttgaagct 1020 
acaagtggtc gcctgaacct tcgttctgag 1080 
gtggcaacag ttaatgaagc ccttgatgcg 1140 
gcgacggtgc gtaagggcat cgtttcagcg 1200 
tcttcaaccc ttgctgatgt taaccgtgtc 1260 
gaagacgtct tccgtcgtct tggctttggt 1320 
agagtcccac gtcgtcgttg ggatatcaca 1380 
cgtatctatg gttatgaccg cttgccaact 1440 
gaattgacag ccacacaaaa actccgccgt 1500 
ttgacagaaa tcatcaccta tactctaaca 1560 
caaccaagta accttacgga actcatgtgg 1620 
caaaatatga tttcaggtat ccttgatacc 1680 
aacttggccc tttacgagat tggaaaagtc 1740 
cttccaaatg aaatcaacag ttttgccttt 1800 
ttccaaacag cagcagttcc agttgatttc 1860 
tttactcgtt tgggactcca agtaacctat 1920 
ccaggtcgta cagccgtgat ttcactcggt 1980 
catccagtca ctgccaaggc ttacgatatt 2040 
tcagctatcg aagctgcgct tcagccagcg 2100 
gcagtcagcc gtgacgttgc ccttctcctc 2160 
gatgctatcc aagctgccgg cgtgaaacgt 2220 
tcaggtgaga aattgggact tggtatgaag 2280 
ccagaagata gcttaacgga cgaagaagtc 234 0 
ctcgaagaaa aagtcaatgc agaagtgcgt 2400 

2403 



gtttcacacg atgaaattac agaagcagct 60 
tatgcaggaa aaaatccaat cttagttggg 120 
gaattggtca aacatattga tacacatatt 180 
catggtggaa cagcaagtag tggtgttatc 240 
aaaggaagac atgttctatt tgtagaagat 300 
ttgcgagata tgtttaaagc aagagaagca 360 
aaaccagaag gacgtgttgt agaaattgag 420 
gagtttgtag taggttatgg tttagactac 480 
ggagtattga aagaggaagt gtattcaaat 540 

543 
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<210> 114 
<211> 235 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 114 

Met He Tyr Ala Gly He Leu Ala Gly Gly Thr Gly Thr Arg Met Gly 
15 10 15 

He Ser Asn Leu Pro Lys Gin Phe Leu Glu Leu Gly Asp Arg Pro He 
20 25 30 



Leu He His Thr He Glu Lys Phe Val Leu Glu Pro Ser He Glu Lys 
35 40 45 

He Val Val Gly Val His Gly Asp Trp Val Leu His Ala Glu Asp Leu 
50 55 -60 

Val Asp Lys Tyr Leu Pro Leu His Lys Glu Arg He He He Thr Lys 
65 70 75 80 

Gly Gly Ala Asp Arg Asn. Thr Ser He Glu Asn He He Glu Ala He 
85 '90 95 

Asp Ala Tyr Arg Pro Leu Thr Pro Glu Asp He Val Val Thr His Asp 
100 105 110 

Ser Val Arg Pro Phe He Thr .Leu Arg Met He Gin Asp Ser He Lys 
115 120 125 

Leu Ala Gin Asn His Asp Ala Val Asp Thr Val Val Glu Ala Val Asp 
130 135 140 

Thr He Val Glu Ser Thr Asn Gly Gin Phe He Thr Gly He Pro Asn 
145 150 155 160 

Arg Ala His Leu Tyr Gin Gly Gin Thr Pro Gin Thr Phe Arg Cys Lys 
165 -170 175 

Asp Phe Met Asp Leu Tyr Gly Ser Leu Ser Asp Glu Glu Lys Glu He 
180 185 -190 

Leu Thr Asp Ala Cys Lys He Phe Val He Lys Gly Lys Asp Val Ala 
195 200 205 

Leu Ala Lys Gly Glu Tyr Ser Asn Leu Lys He Thr Thr Val Thr Asp 
210 215 220 



Leu Lys He Ala Lys Ser Met He Glu Lys Asp 
225 230 235 



<210> 115 
<211> 185 
<212> PRT 

<213> Streptococcus pneumoniae 
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<400> 115 

Met Ala Asn Val lie lie Glu Lys Ala Lys Glu Arg Met Thr Gin Ser 
1-5 10 15 

His Gin Ser Leu Ala Arg Glu Phe Gly Gly lie Arg Ala Gly Arg Ala 
20 25 30 

Asn Ala Ser Leu Leu Asp Arg Val His Val Glu Tyr Tyr Gly Val Glu 
35 40 45 

Thr Pro Leu Asn Gin lie Ala Ser lie Thr lie Pro Glu Ala Arg Val 
50 55 60 

Leu Leu Val Thr Pro Phe Asp Lys Ser Ser Leu Lys Asp lie Glu Arg 
65 70 75 80 

Ala Leu Asn Ala Ser Asp Leu Gly lie Thr Pro Ala Asn Asp Gly Ser 
85 90 95 

Val lie Arg Leu Val lie Pro Ala Leu Thr Glu Glu Thr Arg Arg Asp 
100 105 110* 

Leu Ala Lys Glu Val Lys Lys Val Gly Glu Asn Ala Lys Val Ala Val 
115 120 125 

Arg Asn lie Arg Arg Asp Ala Met Asp Glu Ala Lys Lys Gin Glu Lys 
130 135 140 

Ala Gin Glu lie Thr Glu Asp Glu Leu Lys Thr Leu Glu Lys Asp lie 
145 150 155 160 

Gin Lys Val Thr Asp Asp Ala Val Lys His He Asp Asp Met Thr Ala 
165 170 175 

Asn Lys Glu Lys Glu Leu Leu Glu Val 
180 185 



<210> 116 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 116 

Met Gly Lys Tyr Phe Gly Thr Asp Gly Val Arg Gly Glu Ala Asn Leu 
1 5 .10 15 

Glu Leu Thr Pro Glu Leu Ala Phe Lys Leu Gly Arg Phe Gly Gly Tyr 
20 25 30 

Val Leu Ser Gin His Glu Thr Glu Ala Pro Lys Val Phe Val Gly Arg 
35 40 45 

Asp Thr Arg He Ser Gly Glu Met Leu Glu Ser Ala Leu Val Ala Gly 
50 55 60 

Leu Leu Ser Val Gly He His Val Tyr Lys Leu Gly Val Leu Ala Thr 
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65 



70 



75 



80 



Pro Ala Val Ala Tyr Leu Val Glu Thr Glu Gly Ala Ser Ala Gly Val 
85 90 95 

5 

Met lie Ser Ala Ser His Asn Pro Ala Leu Asp Asn Gly lie Lys Phe 
100 105 110 

Phe Gly Gly Asp Gly Phe Lys Leu Asp Asp Glu Lys Glu Ala Glu lie 
10 115 120 125 

Glu Ala Leu Leu Asp Ala Glu Glu Asp Thr Leu Pro Arg Pro Ser Ala 
130 135 140 

15 Glu Gly Leu Gly He Leu Val Asp Tyr Pro Glu Gly Leu Arg Lys Tyr 
145 150 155 160 



20 



Glu Gly Tyr Arg Val Ser Thr Gly Thr Pro Leu Asp Gly Met Lys Val 
165 170 175 

Ala Leu Asp Thr Ala Asn Gly Ala Ala Ser Thr Ser Ala Arg Gin He 
180 185 190 



Phe Ala Asp Leu Gly Ala Gin Leu Thr Val He Gly Glu Thr Pro Asp 
25 195 200 205 

Gly Leu Asn He Asn Leu Asn Val Gly Ser Thr His Pro Glu Ala Leu 
210 215 220 

30 Gin Glu Val Val Lys Glu Ser Gly Ser Ala He Gly Leu Ala Phe Asp 

225 230 235 240 



35 



Gly Asp Ser Asp Arg Leu He Ala Val Asp Glu Asn Gly Asp He Val 
245 250 255 

Asp Gly Asp Lys He Met Tyr He lie Gly Lys Tyr Leu Ser Glu Lys 

260 265 . 270 



Gly Gin Leu Ala Gin Asn Thr He Val Thr Thr Val Met Ser Asn Leu 
40 275 280 285 

Gly Phe His Lys Ala Leu Asn Arg Glu Gly He Asn Lys Ala Val Thr 
290 295 300 " 

45 Ala Val Gly Asp Arg Tyr Val Val Glu Glu Met Arg Lys Ser Gly Tyr 
305 310 315 320 



50 



Asn Leu Gly Gly Glu Gin Ser Gly His - Val He Leu Met Asp Tyr Asn 
325 330 335 

Thr Thr Gly Asp .Gly Gin Leu Ser Ala Val Gin Leu Thr Lys He Met 
340 345 350 



Lys Glu Thr Gly Lys Ser Leu Ser Glu Leu Ala Ala Glu Val Thr He 
55 355 360 365' 



Tyr Pro Gin Lys Leu Val Asn He Arg Val Glu Asn Val Met Lys Glu 
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370 375 380 

Lys Ala Met Glu Val Pro Ala He Lys Ala He He Glu Lys Met Glu 
385 390 395 400 

5 

Glu Glu Met Ala Gly Asn Gly Arg He Leu Val Arg Pro Ser Gly Thr 

405 410 415 . 

Glu Pro Leu Leu Arg Val Met Ala Glu Ala Pro Thr Thr Glu Glu Val 
10 420 425 430 

Asp Tyr Tyr Val Asp Thr He Thr Asp Val Val Arg Ala Glu He Gly 
435 440 445 ' 

15 He Asp 
450 



<210> 117 
20 <211> 234 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 117 

25 Met Lys Lys He Leu He Val Asp Asp Glu Lys Pro He Ser Asp He 
1 5 10 15 

He Lys Phe Asn Met Thr Lys Glu Gly Tyr Glu Val Val Thr Ala Phe 
20 25 30 

30 

Asn Gly Arg Glu Ala Leu Glu Gin Phe Glu Ala Glu Gin Pro Asp He 
35 40 45 

He He Leu Asp Leu Met Leu Pro Glu He Asp Gly Leu Glu Val Ala 
35 50 55 60 

Lys Thr He. Arg Lys Thr Ser Ser Val Pro He Leu Met Leu Ser Ala 
65 "70 75 80 

40 Lys Asp Ser Glu Phe Asp Lys Val He Gly Leu Glu Leu Gly Ala Asp 

85 90 95 

Asp Tyr Val Thr Lys Pro Phe Ser Asn Arg Glu Leu Gin Ala Arg Val 
100 105 110 

45 

Lys Ala Leu Leu Arg Arg Ser Gin Pro Met Pro Val Asp Gly Gin Glu 
115 120 125 

Ala Asp Ser Lys Pro Gin Pro He Gin He Gly Asp Leu Glu He Val 
50 130 135 140 

Pro. Asp Ala Tyr Val Ala Lys Lys Tyr Gly Glu Glu Leu Asp Leu Thr 
145 150 155 160 

55 His Arg Glu Phe Glu Leu Leu Tyr His Leu Ala Ser His Thr Gly Gin 

165 170 175 
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Val He Thr Arg Glu His Leu Leu 
180 

Phe Gly .Asp Val Arg Thr Val Asp 
5 195 ^ 200 

Lys He Glu Asp Thr Pro Ser Arg 
210 215 

10 Gly Val Gly Tyr Tyr Met Arg Asn 
225 230 



Glu Thr Val Trp Gly Tyr Asp Tyr 
185 190 

Val Thr Val Arg Arg Leu Arg Glu 
205 

Pro Glu Tyr He Leu Thr Arg Arg 
220 

Asn Ala 



<210> 118 
15 <211> 368 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 118 

20 Met Glu Glu He Leu Cys He Gly Cys Gly Ala Thr He Gin Thr Thr 
15. 10 15 

Asp Lys Ala Gly Leu Gly Phe Thr Pro Gin Ser Ala Leu Glu Lys Gly 
20 25 30 

25 

Leu Glu Thr Gly Glu Val Tyr Cys Gin Arg Cys Phe Arg Leu Arg His 
35 40 45 

Tyr Asn Glu He Thr Asp Val Gin Leu Thr Asn Asp Asp Phe Leu Lys 
30 50 55 60 

Leu Leu His Glu Val Gly Asp Ser Asp Ala Leu Val Val Asn Val He 
65 70 75 80 

35 Asp He Phe Asp Phe Asn Gly Ser Val He Pro Gly Leu Pro Arg Phe 

85 .90 95 

Val Ser Gly Asn Asp Val Leu Leu Val Gly Asn Lys Lys Asp He Leu 
100 105 110 

40 

Pro Lys Ser Val Lys Ser Gly Lys He Ser Gin Trp Leu Met Lys Arg 
115 120 125 

Ala His Glu Glu Gly Leu Arg Pro Val Asp Val Val Leu Thr Ser Ala 
45 130 135 140 

Gin Asn Lys His Ala lie Lys Glu Val He Asp Lys He Glu His Tyr 
145 150 155 160 

50 Arg Lys Gly Arg Asp Val Tyr Val Val Gly Val Thr Asn Val Gly Lys 

165 . 170 175 

Ser Thr Leu lie Asn Ala lie He Gin Glu He Thr Gly Asp Gin Asn 
180 185 190 

55 

Val He Thr Thr Ser Arg Phe Pro Gly Thr Thr Leu * Asp Lys He Glu 
195 200 205 
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He Pro Leu Asp Asp Gly Ser 
210 215 

5 His Arg His Gin Met Ala His 
225 230 

Val Ser Pro Lys Lys Glu He 
245 

10 

Glu Gin Thr Leu Phe Leu Gly 
260 

Gly Glu Lys Gin Gly Phe Thr 
15 275 



Tyr He Tyr Asp Thr Pro Gly He He 

220 ■ 

Tyr Leu Thr Ala Lys Asn Leu Lys Tyr 
235 240 

Lys Pro Lys Thr Tyr Gin Leu Asn Pro 
250 255 

Gly Leu Gly Arg Phe Asp Phe He Ala 
265 270 

Ala Phe Phe Asp Asn Glu Leu Lys Leu 
280 • 285 



His Arg Ser Lys Leu Glu Gly Ala Ser Ala Phe Tyr Asp Lys His Leu 

290 295 300 

20 Gly Thr Leu Leu Thr Pro Pro Asn Ser Lys Glu Lys Glu Asp Phe Pro 
305 310 315 320 

Arg Leu Val Gin His Val Phe Thr He Lys Asp Lys Thr Asp Leu Val 
325 330 335 



25 



He Ser Gly Leu Gly Trp He Arg Val Thr Gly Thr Ala Lys Val Ala 
340 345 350 



40 



Val Trp Ala Pro Glu Gly Val Ala Val Val Thr Arg Lys Ala He He 
30 355 360 365 



35 

<210> 119 
<211> 486 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 119 

Met Tyr Pro Asp Asp Ser Leu Thr Leu His Thr Asp Leu Tyr Gin He 
1 5 10 15 

45 Asn Met Met Gin Val Tyr Phe Asp Gin Gly He His Asn Lys Lys Ala 
20 - 25 30 

Val Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala 
,35- 40 45 

50 

Val Phe Ala Gly Leu Glu Arg He Val Asn Tyr Leu Glu Asp Leu Arg 
50 55 60 

Phe Ser Asp Ser Asp He Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly 
55 65 70 75 80 

Ala Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg . 
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85 

Ser Ala Gin Glu Gly Asp Leu Val 
100 

5 

Val Glu Gly Pro Leu Ala Gin Cys 
115 120 



90 95 . 

Phe Ala Asn Glu Pro lie Val Gin 
105 110 

Gin Leu Val Glu Thr Ala Leu Leu 
125 



Asn lie Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Arg lie 
.10 130 135 140 

Arg Ser Val He Glu Asp Glu Pro Leu Met Glu Phe Gly Thr Arg Arg 
145 150 155 160 

15 Ala Gin Glu Thr Asp Ala Ala He Trp Gly Thr Arg Ala Ala Val He 

165 170 175 

Gly Gly Ala Asn Gly Thr Ser Asn Val Arg Ala Gly Lys Leu Phe Asp 
180 185 190 

20 

He Pro Val Leu Gly Thr His Ala His Ala Leu Val Gin Val Tyr Gly 
195 200 205 

Asn Asp Tyr Glu Ala Phe Lys Ala Tyr Ala Ala Thr His Lys Asn Cys 
25 210 215 220 



Val Phe Leu Val Asp Thr Tyr Asp Thr Leu Arg He Gly Val Pro Ala 

225 230 235 240 

30 Ala He Gin Val Ala Arg Glu Leu Gly Asp Gin He Asn Phe Met Gly 

245 250 255 



35 



Val Arg He Asp Ser Gly Asp He Ala Tyr He Ser Lys Lys Val Arg 
260 265 270 

Gin Gin Leu Asp Glu Ala Gly Phe Thr Glu Ala Lys He Tyr Ala Ser 
275 280 285 



Asn Asp Leu Asp Glu Asn Thr He Leu Asn Leu Lys Met Gin Lys Ala 
40 290 295 300 

Lys He Asp Val Trp Gly Val Gly Thr Gin Leu He Thr Ala Tyr Asp 
305 310 315 320 

45 Gin Pro Ala Leu Gly Ala Val Tyr Lys He Val Ala He Glu Asp Glu 

325 330 335 



50 



Thr Gly Gin Met Arg Asn Thr He Lys - Leu Ser Asn Asn Ala Glu Lys 
340 345 350 

Val Ser Thr Pro Gly Lys Lys Gin Val Trp Arg He Thr Ser Arg Glu 
355 360 365 



Lys- Gly Lys Ser Glu Gly Asp Tyr He Thr Tyr Asp Gly Val Asp lie 
55 370 375 380 



Ser Asp Met Thr Glu lie Lys Met Phe His Pro Thr Tyr Thr Tyr He 
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385 



390 



395 



400 



Lys 



Lys 



Thr Val • Arg Asn 
405 



Phe Asp Ala Val Pro Leu Leu 
410 



Val - Asp He 
415 



Phe Lys Glu Gly He Leu Val Tyr Asn Leu Pro Ser Leu Thr Asp He 
420 425 430 

Gin Asp Tyr Ala Arg Lys Glu Phe Asp Lys Leu Trp Asp Glu Tyr Lys 
435 440 445 

Arg Val Leu Asn Pro Gin His Tyr Pro Val Asp Leu Ala Arg Asp Val 
450 455 460 

Trp Gin Asp Lys Met Asp Leu He Asp Lys Met Arg Lys Glu Ala Leu 
465 470 475 480 

Gly Glu Gly Glu Glu Glu 



<210> 120 
<211> 283 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 120 

Met Ala Thr He Gin Trp Phe Pro Gly His Met Ser Lys Ala Arg Arg 
1 5 10 15 

Gin Val Gin Glu Asn Leu Lys Phe Val Asp Phe Val Thr He Leu Val 
20 25 30 

Asp Ala Arg Leu Pro Leu Ser Ser Gin Asn Pro Met Leu Thr Lys He 
35 40 45 

Val Gly Asp Lys Pro Lys Leu Leu He Leu Asn Lys Ala Asp Leu Ala 
50 55 60 

Asp Pro Ala Met Thr Lys Glu Trp Arg Gin Tyr Phe Glu Ser Gin Gly 
65 70 75 80 

He Gin Thr Leu Ala He Asn Ser Lys Glu Gin Val Thr Val Lys Val 
85 '90 95 

Val Thr Asp Ala Ala Lys Lys Leu Met Ala Asp Lys He Ala Arg Gin 
100 105 110 

Lys Glu Arg Gly He Gin He Glu Thr Leu Arg Thr Met He He Gly 
115 120 . 125 

He Pro Asn Ala Gly Lys Ser Thr Leu Met Asn Arg Leu Ala Gly Lys 
130 135 140 



485 



Lys He Ala Val Val Gly Asn Lys Pro Gly Val Thr Lys Gly Gin Gin 
145 150 155 160 



WO 01/49721 



PCT/US00/35604 



Trp Leu Lys Thr Asn Lys Asp Leu Glu lie Leu Asp Thr Pro Gly lie 
165 170 175 

Leu Trp Pro Lys Phe Glu Asp Glu Thr Val Ala Leu Lys Leu Ala Leu 
180 185 190 

Thr Gly Ala lie Lys Asp Gin Leu Leu Pro Met Asp Glu Val Thr lie 
195 200 205 

Phe Gly lie Asn Tyr Phe Lys Glu His Tyr Pro Glu Lys Leu Ala Glu 
210 215 220 

Arg Phe Lys Gin Met Lys lie Glu Glu Glu Pro Ser Val He He- Met 
225 230 . 235 240 

Asp Met Thr Arg Ala Leu Gly Phe Arg Asp Asp Tyr Asp Arg Phe Tyr 
245 .250 255. 

Ser Leu Phe Val Lys Glu Val Arg Asp Gly Lys Leu Gly Asn Tyr Thr 
260 * 265 270 

Leu Asp Thr Leu Glu Asp Leu Asp Gly Asn Asp 
275 '280 



<210> 121 
<211> 156 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 121 

Met He Asn Asn Val. Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
1 " 5 . 10 15 

Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
' 35 40 45 

He Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ser Leu He Gly Val Thr Gly Arg He Gin Thr Arg 
65 70 75 * 80 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
85 " 90 95 

Ala Glu Asn. Phe Gin Met Leu Glu. Ser Arg Ser Val Arg Glu Gly His 
100 105 110" 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala Pro Thr 
115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 
130 135 140 
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Asn Pro Leu Asp He Ser Asp Asp Asp Leu Pro Phe 
145 150 155 

5 

<210> 122 
<211> 324 
<212> PRT 

<213> Streptococcus pneumoniae 

10 

<400> 122 

Met Lys Thr Arg He Thr Glu Leu Leu Lys He Asp Tyr Pro He Phe 
15 10 15 

15 Gin Gly Gly Met Ala Trp Val Ala Asp Gly Asp Leu Ala Gly Ala Val 
20 25 30 

Ser Lys Ala Gly Gly Leu Gly He He Gly Gly Gly Asn Ala Pro Lys 
35 40 45 

20 

Glu Val Val Lys Ala Asn He Asp Lys lie Lys Ser Leu Thr Asp Lys 
50 55 60 

Pro Phe Gly Val "Asn He Met Leu Leu Ser Pro Phe Val Glu Asp He 
25 65 70 75 80 

Val Asp Leu Val He Glu Glu Gly Val Lys Val Val Thr Thr Gly Ala 
85 90 95 

30 Gly Asn Pro Ser Lys Tyr Met Glu Arg Phe His Glu Ala Gly He He 
100 105 110 

Val He Pro Val Val Pro Ser Val Ala Leu Ala Lys Arg Met Glu Lys 
115 . 120 125 

35 

He Gly Ala Asp Ala Val lie Ala Glu Gly Met Glu Ala Gly Gly His 

130 135 140 

He Gly Lys Leu Thr Thr Met Thr Leu Val Arg Gin Val Ala Thr Ala 
40 145 . 150 155 . 160 

He Ser He Pro Val He Ala Ala Gly Gly lie Ala Asp Gly Glu Gly 
165 170 175 

45 Ala Ala Ala Gly Phe Met Leu Gly Ala Glu Ala Val Gin Val Gly Thr 
180 185 190 

Arg Phe Val. Val- Ala Lys Glu Ser Asn Ala His Pro Asn Tyr Lys Glu 
195 200 205 

50 

Lys He Leu Lys Ala Arg Asp lie Asp Thr Thr He Ser Ala Gin His 
210 215 220 

Phe Gly His Ala Val Arg Ala He Lys Asn Gin Leu Thr Arg Asp Phe 
55 225 230 235 240 

Glu Leu Ala Glu Lys Asp Ala Phe Lys Gin Glu Asp Pro Asp Leu Glu 
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245 250 2'55 

lie Phe Glu Gin Met Gly Ala Gly Ala Leu Ala Lys Ala Val Val His 
260 265 270 

Gly Asp Val Glu Gly Gly Ser Val Met Ala Gly Gin He Ala Gly Leu 
275 280 285 

Val Ser Lys Glu Glu Thr Ala Glu- Glu He Leu Lys Asp Leu Tyr Tyr 
290 295 300 

Gly Ala Ala Lys Lys He Gin Glu Glu Ala Ser Arg Trp Thr Gly Val 
305 * 310 315 320 

Val Arg Asn Asp 



<210> 123 
<211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 123 

Met He Asp He Gin Gly He Lys Glu Ala Leu Pro His Arg Tyr Pro 
1 5 10 15 

Met Leu Leu Val Asp Arg Val Leu Glu Val Ser Glu Asp Thr He Val 
20 25 30 

Ala He Lys Asn Val Thr He Asn Glu Pro Phe Phe Asn Gly His Phe 
■ 35 40 45 

Pro Gin Tyr Pro Val Met Pro Gly Val Leu He Met Glu Ala Leu Ala 
50 ■ * 55 60 

Gin Thr Ala Gly Val Leu Glu Leu Ser Lys Pro Glu Asn Lys Gly Lys 
65 -70 75 80 

Leu Val Phe Tyr Ala Gly Met Asp Lys Val Lys Phe Lys Lys Gin Val 
85 90 95 

Val Pro Gly Asp Gin Leu Val Met Thr Ala Thr Phe Val Lys Arg Arg 
100 105 110 

Gly Thr He Ala Val Val Giu Ala Lys Ala Glu Val Asp Gly Lys Leu 
115 120 125 

Ala Ala Ser Gly Thr Leu Thr Phe Ala He Gly Asn 
130 135 140 



<210> 124 
<211> 340 
<212> PRT 

<213> Streptococcus pneumoniae 
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10 



<400> 124 

Met He Asn Gin He Tyr Gin Leu Thr Lys Pro Lys Phe He Asn Val 
1 5 10 .15 

Lys Tyr Gin Glu Glu Ala He Asp Gin Glu Asn His He Leu He Arg 
20 25 30 

Pro Asn Tyr Met Ala Val Cys His Ala Asp Gin Arg Tyr Tyr Gin Gly 
35 40 45 

Lys Arg Asp Pro Lys He Leu Asn Lys Lys Leu Pro Met Ala Met He 
50 55 60 



His Glu Ser Cys Gly He Val He Ser Asp Pro Ser Gly Thr Tyr Glu 
15 65 70 75 80 

Val Gly Gin Lys Val Val Met He Pro Asn Gin Ser Pro Met Gin Ser 
85 90 95 

20 Asp Glu Glu Phe Tyr Glu Asn Tyr Met Thr Gly Thr His Phe Leu Ser 
100 105 110 



25 



Ser Gly Phe Asp Gly Phe Met Arg Glu Phe Val Ser Leu Pro Lys Asp 

115 120 125 

Arg Val Val Ala Tyr Asp Ala He Glu Asp Thr Val Ala Ala He Thr 

130 135 140 



Glu Phe Val Ser Val Gly Met His Ala Met Asn Arg Leu Leu Thr Leu 
30 145 150 155 160 

Ala His Ser Lys Arg Glu Arg He Pro Val He Gly Asp Gly Ser Leu 
165 170 175 

35 Ala Phe Val Val Ala Asn He He Asn Tyr Thr Leu Pro Glu Ala Glu 
180 185 190 



40 



He Val Val He Gly Arg His Trp Glu Lys Leu Glu Leu Phe Ser Phe 
195 200 205 

Ala Lys Glu Cys Tyr He Thr Asp Asn He Pro Glu Glu Leu. Ala Phe 
210 215 220 



Asp His Ala Phe Glu Cys Cys Gly Gly Asp Gly Thr Gly Pro Ala He 
45 225 230 235 240 

Asn Asp Leu He Arg Tyr He Arg Pro Gin Gly Thr He Leu Met Met 
245 -250 255 

50 Gly Val Ser Glu Tyr Lys Val Asn Leu Asn Thr Arg Asp Ala Leu Glu 
260 265 270 



55 



Lys Gly Leu Leu Leu Val Gly Ser Ser Arg Ser Gly Arg He Asp Phe 
275 280 285 

Glu Asn Ala He Gin Met Met Lys Val Lys Lys Phe Ala Asn Arg Leu 
290 295 300 
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Lys Asn He Leu Tyr Leu Glu Glu Pro Val Arg Glu He Lys Asp He 

305 310 315 320 

5 His Arg Val Phe Ala Thr Asp Leu Asn Thr Ala Phe Lys Thr Val Phe 

325 330 335 



10 



20 



- 35 



50 



Lys Trp Glu Val 
340 



<210> 125 

<211> 447 

<212> PRT 

15 <213> Streptococcus pneumoniae 



<400> 125 

Met Asn Leu Lys Thr Thr Leu Gly Leu Leu Ala Gly Arg Ser Ser His 
15 10 15 

Phe Val Leu Ser Arg Leu Gly Arg Gly Ser Thr Leu" Pro Gly Lys Val 
20 25 30 



Ala Leu Gin Phe Asp Lys Asp He Leu Gin Asn Leu Ala Lys Asn Tyr 
25 35 ' 40 45 

Glu He Val Val Val Thr Gly Thr Asn Gly Lys Thr Leu Thr Thr Ala 
50 55 60 

30 Leu Thr Val Gly He Leu Lys Glu Val Tyr Gly Gin Val Leu Thr Asn 
65 70 75 80 



Pro Ser Gly Ala Asn Met He Thr Gly lie Ala Thr Thr Phe Leu Thr 

85 90 95 

Ala Lys Ser Ser Lys Thr Gly Lys Asn He Ala Val Leu Glu He Asp 
100 105 110 



Glu Ala Ser Leu Ser Arg He Cys Asp Tyr He Gin Pro Ser Leu Phe 
40 115 120 125 

Val He Thr Asn He Phe Arg Asp Gin Met Asp Arg Phe Gly Glu He 
130 135 140 

45 Tyr Thr Thr Tyr Asn Met He Leu Asp Ala He Arg Lys Val Pro Thr 
145 150 155 160 



Ala Thr Val Leu Leu Asn Gly Asp Ser~. Pro Leu Phe Tyr Lys Pro Thr 
165 170 175 

He Pro Asn Pro He Glu Tyr Phe Gly Phe Asp Leu Glu Lys Gly Pro 
180 185 190 



Ala Gin Leu Ala His Tyr Asn Thr Glu Gly He Leu Cys Pro Asp Cys 
55 195 200. 205 

Gin Gly He Leu Lys Tyr Glu His Asn Thr Tyr Ala Asn Leu Gly Ala 
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210 215 220 

Tyr lie Cys Glu Gly Cys Gly Cys Lys Arg. Pro Asp Leu Asp Tyr Arg 
225 230 235 240 

Leu Thr Lys Leu Val Glu Leu Thr Asn Asn Arg Ser Arg Phe Val lie 
245 2*50 255 

Asp Gly Gin Glu Tyr Gly lie Gin lie Gly Gly Leu Tyr Asn He Tyr 
260 265 270 

Asn Ala Leu Ala Ala Val Ala He Ala Arg Phe Leu Gly Ala Asp Ser 
275 280 285 

Gin Leu He Lys Gin Gly Phe Asp Lys Ser Arg Ala Val Phe Gly Arg 
290 295 300 

Gin Glu Thr Phe His He Gly Asp Lys Glu Cys Thr Leu Val Leu He 
305 310 315 320 

Lys Asn Pro Val Gly Ala Thr Gin Ala He Glu Met He Lys Leu Ala 
325 330 335 

Pro Tyr Pro Phe Ser Leu Ser Val Leu Leu Asn Ala Asn Tyr Ala Asp 
340 345 350 

Gly He Asp Thr Ser Trp He Trp Asp Ala Asp Phe Glu Gin lie Thr 
355 360 365 

Asp Met Asp He Pro Glu He Asn Ala Gly Gly Val Arg His Ser Glu 
370 375 380 

He Ala Arg Arg Leu Arg Val Thr Gly Tyr Pro Ala Glu Lys He Thr 
385 390 395 400 

Glu Thr Ser Asn Leu Glu Gin Val Leu Lys Thr He Glu Asn Gin Asp 
405 410 415 

Cys Lys His Ala Tyr He Leu Ala Thr Tyr Thr Ala Met Leu Glu Phe 
420 425 430 

Arg Glu Leu Leu Ala Ser Arg Gin He Val Arg Lys Glu Met Asn 
435 440 445 



<210> 126 
<211> 260 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 126 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 
15 10 15 

Leu Asn He Ala His Leu Tyr Gly Asn .Leu Met Asn Thr Tyr Gly Asp 
20 25 30 
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Asn Gly Asn He Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Ala 
35 40 45 

His Val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 
5 50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 .70 75 80 

10 lie He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 90 95 

He Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 
100 105 110 

15 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 125 

Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe He 
20 . 130 135 140 

Gly Asp He Lys He His Asn Glu Asp Phe Asp Glu Thr Tyr Tyr Gly 
145 150 155 160 

25 Phe Glu Asn His Gin Gly Arg Thr Phe Leu Ser Asp Asp Gin Lys Pro 

165 170 175- 

Leu Gly Gin Val Val Tyr Gly Asn Gly Asn Asn Glu Glu Lys Val Gly 
180 185 190 

30 

Glu Gly Val His Tyr. Lys Asn Val Phe Gly Ser Tyr Phe His Gly Pro 
195 200 205 

He Leu Ser Arg Asn Ala Asn Leu Ala Tyr Arg Leu Val Thr Thr Ala 
35 210 215 220 

Leu Lys Lys Lys Tyr Gly Gin Asp He Gin Leu Pro Ala Tyr Glu Asp 
225 230 235 240 

40 -lie Leu Ser Gin Glu He Ala Glu Glu Tyr Ser Asp Val Lys Ser Lys 

245 . 250 255 

Ala Asp Phe Ser 
260 

45 

<210> 127 
<211> 223 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 127 

Met Asn Val Lys Glu Asn Thr Glu Leu Val Phe Arg Glu Val Ala Glu 
15 10 15 

Ala Ser Leu Ser Ala Asn Arg Glu Ser Gly Ser Val Ser Val lie Ala 
20 25 30 
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Val Thr Lys Tyr Val Asp Val Pro Thr Ala Glu Ala Leu Leu Pro Leu 
35 40 45' 

5 Gly Val His His lie Gly Glu Asn Arg Val Asp Lys Phe Leu Glu Lys 
50 55 60 



10 



Tyr Glu Ala Leu Lys Asp Arg Asp Val Thr Trp His Leu lie Gly Thr 
65 70 75 80 

Leu Gin Arg Arg Lys Val Lys Asp Val lie Gin Tyr Val Asp Tyr Phe 
85 90 95 



His Ala Leu Asp Ser Val Lys Leu Ala Gly Glu lie Gin Lys Arg Ser 
15 100 105 110 

Asp Arg Val lie Lys Cys Phe Leu Gin Val Asn lie Ser Lys Glu Glu 
115 120 ' 125 

20 Ser Lys His Gly Phe Ser Arg Glu Glu Leu Leu Glu lie Leu Pro Glu 
130 135 140 



25 



Leu Ala Gly Leu Asp Lys lie Glu Tyr Val Gly Leu Met Thr Met Ala 
145 150 155 160 

Pro Phe Glu Ala Ser Ser Glu Gin Leu Lys Glu lie Phe Lys Ala Ala 
165 170 175 



Gin Asp Leu Gin Arg Glu lie Gin Glu Lys Gin lie Pro Asn lie Pro 
30 180 .185 190 



Met Thr Glu Leu Ser Met Gly Met Ser Arg Asp Tyr Lys Glu Ala lie 
195 ' 200 205 

35 Gin Phe Gly Ser Thr Phe Val' Arg lie Gly Thr Ser Phe Phe Lys 
210 " 215 220 



<210> 128 

40 <211> 279 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 128 

45 Met Gly He Ala Leu Glu Asn Val Asn Phe Thr Tyr Gin Glu Gly Thr 
1 5 10 15 

Pro Leu Ala Ser Ala Ala Leu Ser Asp Val Ser Leu Thr He Glu Asp 
* 20 25 30 

50 

Gly Ser Tyr Thr Ala Leu He Gly His Thr Gly Ser Gly Lys Ser Thr 
35 40 45 

He Leu Gin Leu Leu Asn Gly Leu Leu Val Pro Ser Gin Gly Ser Val 
55 50- 55 60 

Arg Val Phe Asp Thr Leu lie Thr Ser Thr Ser Lys Asn Lys Asp He 
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65 



70 



75 



80 



Arg Gin He Arg Lys Gin Val Gly Leu Val Phe Gin Phe Ala Glu Asn 
85 90 95 

5 

Gin He Phe Glu Glu Thr Val Leu Lys Asp Val Ala Phe Gly Pro Gin 
100 105 110 

Asn Phe Gly Val Ser Glu Glu Asp Ala Val Lys Thr Ala Arg Glu Lys 
10 115 120 125 

Leu Ala Leu Val Gly He Asp Glu Ser Leu Phe Asp Arg Ser Pro Phe 
130 135 140 

15 ■ Glu Leu Ser Gly Gly Gin Met Arg Arg Val Ala He Ala Gly He Leu 
145 150 155 160 



20 



Ala Met Glu Pro Ala He Leu Val Leu Asp Glu Pro Thr Ala Gly Leu 
165 170 175 

Asp Pro. Leu Gly Arg Lys Glu Leu Met Thr Leu Phe Lys Lys Leu His 
180 185 190 



Gin Ser Gly Met Thr He Val Leu Val Thr His Leu Met Asp Asp Val 
25 195 200 - 205 

Ala Glu Tyr Ala Asn Gin Val Tyr Val Met Glu Lys Gly Arg Leu Val 
210 215 220 

30 Lys Gly Gly Lys Pro Ser Asp Val Phe Gin Asp Val Val Phe Met Glu 
225 . 230 235 240 



35 



Glu Val Gin Leu Gly Val Pro Lys He Thr Ala Phe Cys Lys Arg Leu 

245 250 255 

Ala Asp Arg Gly Val Ser Phe Lys Arg Leu Pro Val Lys lie Glu Glu 

260 . . 265 270 



Phe Lys Glu Ser Leu Asn Gly 
40 275 



<210> 129 
<211> 309 
45 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 129 

Met Asp He Gin Phe Leu Gly Thr Gly Ala Gly Gin Pro Ser Lys Ala 
50 1 5 10 ■ 15 

Arg Asn Val Ser Ser Leu Ala Leu Lys Leu Leu Asp Glu He Asn Glu 
20 . 25 30 

55 Val Trp Leu Phe Asp Cys Gly Glu Gly Thr Gin Asn Arg He Leu Glu 
35 40 45 
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Thr Thr lie Arg Pro Arg Lys Val Ser Lys lie Phe lie Thr His Leu 
50 55 60 

His Gly Asp His lie Phe Gly Leu Pro Gly Phe Leu Ser Ser Arg Ala 
5 65 70 75 80 

Phe Gin Ala Asn Glu Glu Gin Thr Asp Leu Glu lie Tyr <31y Pro Gin 
85 90 95 

10 Gly lie Lys Ser Phe Val Leu Thr Ser Leu Arg Val Ser Gly Ser Arg 
100 105 110 

Leu Pro Tyr Arg lie His Phe His Glu Phe Asp Gin Asp Ser Leu Gly 
115 120 125 

15 

Lys lie Leu Glu He Asp Lys Phe Thr Val Tyr Ala Glu Glu Leu Asp 
130 135 140 

His Thr He Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp Leu Glu 
20 145 150 155 160 

.Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro Phe Gly 
165 170 175 

25 Pro Leu Phe Gly Lys He .Lys Asn Gly Gin Asp Leu Val Leu Glu Asp 
180 185 190 

Gly Thr Glu He Lys Ala Ala Asp Tyr He Ser Ala Pro Arg Pro Gly 
195 200 205 

30 

Lys He lie Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala Ser Val 
210 215 220 

Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr Tyr Gly 
35 225 230 ' 235 240 

Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr Asn Met 
245 250 255 

40 Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu Leu Leu 
260 265 270 

Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys Leu Lys 
275 280 285 

45 

Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val Lys Asp Leu 
290 295 _ 300 

Glu Glu Val Glu He 
50 305 



<210> 130 
<211> 553 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 130 
Met Ser Asn lie 
1 

Lys Asn Met Tyr 
20 

Val Gly Leu Lys 
35 

lie Pro Asn Met 
50 

Val Phe Leu Thr 
65 

Leu Leu Ala Glu 



Glu Leu Ala Lys 
100 

Asn Asp Phe His 
115 

Thr Val Val Ser 
130 

Gly He Val Leu 
145 

Phe Lys Phe Asp 



Arg Leu Ala Glu 
180 

Ser Ala Asn Ala 
195 

Arg Asp Glu He 
210 

Val Ala Ala Val 
225 

Ala Ala Asp Lys 



Glu Asn He Val 
260 

Asn Glu He Leu 

275 . 

His Glu Leu He 
290 



Ser Leu Thr Thr 
5 

lie Ala Glu He 



Tyr Pro Glu Asn 
40 

Asp Tyr Leu Phe 
55 

His Gly His Ala 
70 

Ala Lys Val Pro 
85 

Leu Phe Val Lys 



Val He Asp Glu 
120 

Phe Phe Pro Thr 
135 

Lys Thr Ser Glu 
150 

Gin Thr Ala Ser 
165 

He Gly Arg Asp 



Asp Ser Asn He 
200 

Thr Gin Thr He 
215 

Ser Ser Asn Leu 
230 

Thr Gly Arg Arg 
245 

Arg Thr Ala He 



Leu He Lys Pro 
280 

He Leu Glu Thr 
295 



Leu Gly Gly Val 
10 

Gly Glu Ser He 
25 

Glu Gin Leu Gly 



Glu Asn Ser Asp 
60 

Asp Ala He Gly 
75 

Val- Phe Gly Ser 
90 

Gly Asn Asp Ala 
105 

Asn Thr Glu He 



Thr Tyr Ser Val 
140 

Gly Ser He Val 
155 

Glu Ser Tyr Ala 
170 

Gly Val Leu Ala 
185 

Gin Val Ala Ser 



Ala Asp Trp Glu 
220 

Ser Arg He Gin 
235 

He Val Leu Thr 
'250 

Arg Leu Lys Lys 
265 

Lys Asp Met Ser 



Gly Arg Met Gly 
300 



Arg Glu Asn Gly 
15 

Phe Val Leu Asn 
30 

Val Asp Val Val 
45 

Arg lie Ala Gly 



Ala Leu Pro Tyr 
80 

Glu Leu Thr He 
95 

Val Lys Lys Phe 
110 

Asp Phe Gly Gly 
125 

Pro Glu Ser Leu 



Tyr Thr Gly Asp 
160 

Thr Asp Phe Ala 
175 

Leu Leu Ser Asp 
190 

Glu Ser Glu Val 
205 

Gly Arg He He 



Gin He. Phe Asp 
240 

Gly Phe Asp He 
255 

Leu Ser Leu Ala 
270 

Arg Phe Glu Asp 
285 

Glu Pro He Asn 
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Gly Leu Arg Lys Met Ser lie Gly Arg His Arg Tyr Val Glu lie Lys 
305 310 315 320 

• • 5 Asp Gly Asp Leu Val Tyr lie Ala Thr Ala Pro Ser He Ala Lys Glu 

325 330 ' 335 

Ala Phe Val Ala Arg Val Glu Asn Met He* Tyr Gin Ala Gly Gly Val 
340 345 350 

10 

Val Lys Leu He Thr Gin Ser Leu His Val Ser Gly His Gly Asn Val 
355 360 365 

Arg Asp Leu Glri Leu Met He Asn Leu Leu Gin Pro Lys Tyr Leu Phe 
15 370 375 380 

Pro Val Gin Gly Glu Tyr Arg Glu Leu Asp Ala His Ala Lys Ala Ala 
385 390 395 400 

20 Met Ala Val Gly Met Leu Pro Glu Arg He Phe He Pro Lys Lys Gly 

405 410 415 

Thr Thr Met Ala Tyr Glu Asn Gly Asp Phe Val Pro Ala Gly Ser Val 
420 425 430 

25 ' 
Ser Ala Gly Asp He Leu lie Asp Gly Asn Ala He Gly Asp Val Gly 
435 - 440 445 

Asn Val Val Leu Arg Asp Arg Lys Val Leu Ser Glu Asp Gly He Phe 
30 450 455 460 

He Val Ala He Thr Val Asn Arg Arg Glu Lys Lys He Val Ala Arg 
465 470 . 475 480 

35 Ala Arg Val His Thr Arg Gly Phe Val Tyr Leu Lys Lys Ser Arg Asp 

485 490 495 

He Leu Arg Glu Ser Ser Glu Leu He Asn Gin Thr Val Glu Asp Tyr 
500 505 510 

40 

Leu Gin Gly Asp Asp Phe Asp Trp Ala Asp Leu Lys Gly Lys Val Arg 
515 520 525 

Asp Asn Leu Thr Lys Tyr Leu Phe Asp Gin Thr Lys Arg Arg Pro -Ala 
.45 530 535- 540 

He Leu Pro Val Val Met Glu Ala Lys 
545 550 

50 

<210> 131 
<211> 316 
. <212> PRT 
<213> Streptococcus pneumoniae 

55 

<400> 131 

Met Thr Lys Glu Phe His His Val Thr Val Leu Leu His Glu Thr He 
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1 5 10 15 

Asp Met Leu Asp Val Lys Pro Asp Gly He Tyr Val Asp Ala Thr Leu 
20 25 30 

Gly Gly Ala Gly His Ser Glu Tyr Leu Leu Ser Lys Leu Ser Glu Lys 
35 40 45 

Gly His Leu Tyr Ala Phe Asp Gin Asp Gin Asn Ala He Asp Asn Ala 
50 55 60 



Gin .Lys Arg Leu Ala Pro Tyr He 
65 70 

Lys Asp Asn Phe Arg His Leu Gin 
85 

Gin Glu He Asp Gly He Cys Tyr 
100 

Leu Asp Gin Arg Glu Arg Gly Phe 
115 120 

* Asp Met Arg Met Asn Gin Asp Ala 
130 135 



Glu Lys Gly Val Val Thr Phe He 
75 80 

Ala Arg Leu Arg Glu Ala Gly Val 
90 95 

Asp Leu Gly Val .Ser Ser Pro Gin 
105 110 

Ser Tyr Lys Lys Asp Ala Pro Leu 
125 

Ser Leu Thr Ala Tyr Glu Val Val 
140 



Asn His Tyr Asp Tyr His Asp Leu 
145 150 

Glu Asp Lys Phe Ser Lys Gin He 
165 

Glu Val Lys Pro He Glu Thr Thr 
180 



Val Arg He Phe Phe Lys Tyr Gly 
155 . 160 

Ala Arg Lys He Glu Gin Ala Arg 
170 175 

Thr Glu Leu Ala Glu He He Lys 
185 190 



Leu Val Lys Pro Ala Lys Glu Leu 
195 200 

Gin He Phe Gin Ala He Arg He 
210 215 

Ala Asp Glu Ser He Gin Gin Ala 
225 230 

Arg He Ser Val He Thr Phe His 
245 

Gin Leu Phe Lys Glu Ala Ser Thr 
260 

Phe He Pro Asp Asp Leu Lys Pro 
275 - 280 

Pro He Leu Pro Ser Ala Glu Glu 
290 295 

Ser Ala Lys Leu Arg Val Val Arg 



Lys Lys Lys Gly His Pro Ala Lys 
205 

Glu Val Asn Asp Glu Leu Gly Ala 
220 

Met Asp Met Leu Ala Leu Asp Gly 
235 240 

Ser Leu Glu Asp Arg Leu Thr Lys 
250 255 

Val Glu Val Pro Lys Gly Leu Pro 
265 270 

Lys Met Glu Leu Val Ser Arg Lys 
285 

Leu Glu Ala Asn Asn Arg Ser His 
300 

Lys He His Lys 
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305 



310 



315 



<210> 132 

<211> 332 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 132 

Met Ser Arg lie Leu Asp Asn Glu He Met Gly Asp Glu Glu Leu Val 
1 5 10 15 

Glu Arg Thr Leu Arg Pro Gin Tyr Leu Arg Glu Tyr He Gly Gin Asp 
20 25 30 

Lys Val Lys Asp Gin Leu Gin He Phe He Glu Ala Ala Lys Met Arg 
35 40 45 

Asp Glu Ala Leu Asp His Val Leu Leu Phe Gly Pro Pro Gly Leu Gly 
50 ' 55 60 

Lys Thr Thr Met Ala Phe Val He Ala Asn Glu Leu Gly Val Asn Leu 
65 70 75 80 

Lys Gin Thr Ser Gly Pro Val He Glu Lys Ala Gly Asp Leu Val Ala 
85 90 95 

He Leu Asn Glu Leu Glu Pro Gly Asp Val Leu Phe He Asp Glu He 
100 105 110 

His Arg Leu Pro Met Ser Val Glu Glu Val Leu Tyr Ser Ala Met Glu 
115 120 125 

Asp Phe Tyr He Asp He Met He Gly Ala Gly Glu Gly Ser Arg Ser 
130 135 140 

Val His Leu Glu ,Leu Pro Pro Phe Thr Leu He Gly Ala Thr Thr Arg 
145 150 155 160 

Ala Gly Met Leu Ser Asn Pro Leu Arg Ala Arg Phe Gly He Thr Gly 
165 170 .175 

His Met Glu Tyr Tyr Ala His Ala Asp Leu Thr . Glu He Val Glu Arg 
180 185 190 

Thr* Ala Asp He Phe Glu Met Glu He Thr His Glu Ala Ala Ser Glu 
195 200 205 

Leu Ala Leu Arg Ser Arg Gly Thr Pro Arg He Ala Asn Arg Leu Leu 
210 215 220 

Lys Arg Val Arg Asp Phe Ala Gin He Met Gly Asn Gly Val He Asp 
225 230 235 240 

Asp lie He Thr Asp Lys Ala Leu Thr Met Leu Asp Val Asp His Glu 



245 



250 



"255 
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Gly Leu Asp Tyr Val Asp Gin Lys lie Leu Arg Thr Met lie Glu Met 
260 265 270 

Tyr Ser Gly Gly Pro Val Gly Leu Gly Thr Leu Ser Val Asn lie Ala 
5 275 280 285 

Glu Glu Arg Glu Thr Val Glu Asp Met Tyr Glu Pro Tyr Leu lie Gin 
290 295 300 

10 Lys Gly Phe lie Met Arg Thr Arg Ser Gly Arg Val Ala Thr Ala Lys 
305 310 315 320 

Ala Tyr Glu His Leu Gly Tyr Glu Tyr Ser Glu Lys 
325 330 

15 

<210> 133 
<211> 436 
<212> PRT 
20 <213> Streptococcus pneumoniae 

<400> 133 

Met Ser Met Phe Leu Asp Thr Ala Lys lie Lys Val Lys Ala Gly Asn 
15 10 15 

25 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 
20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 
30 35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 
50 55 60 

35 His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
.65 70 75 80 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Ser Gin Gly Thr Thr 
85 90 95 

40 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu lie Lys His 
100 105 110 

Gly Gin Glu Phe lie Val Ala His Gly Gly Arg Gly Gly Arg Gly Asn 
45 115 120 125 

lie Arg Phe Ala Thr Pro Lys Asn Pro Ala Pro Glu lie Ser Glu Asn 
130 135 140 

50 Gly Glu Pro Gly Gin Glu Arg Glu Leu Gin Leu Glu Leu Lys He Leu 
145 150 155 160 

Ala Asp Val Gly Leu Val Gly Phe Pro Ser Val Gly Lys Ser Thr Leu 
165 170 175 

55 

Leu Ser Val He Thr Ser Ala Lys Pro Lys He Gly Ala Tyr His Phe 
180 185 190 
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Thr Thr He Val Pro Asn Leu Gly Met Val Arg Thr Gin Ser Gly Glu 
195 200 205 

Ser Phe Ala Val Ala Asp Leu Pro Gly Leu He Glu Gly Ala Ser Gin 
210 215 220 

Gly Val Gly Leu Gly Thr Gin Phe Leu Arg His He Glu Arg Thr Arg 
225 230 235 240 

Val He Leu His He He Asp Met Ser Ala Ser Glu Gly Arg Asp Pro 
245 250 255 

Tyr Glu Asp Tyr Leu Ala He Asn Lys Glu Leu Glu Ser Tyr Asn Leu 
260 265 270 

Arg Leu Met Glu Arg Pro Gin He He Val Ala Asn Lys Met Asp Met 
275 280 285 

Pro Glu Ser Gin Glu Asn Leu Glu Glu Phe Lys Lys Lys Leu Ala Glu 
290 295 300 

Asn Tyr Asp Glu Phe Glu Glu Leu Pro Ala He Phe Pro He Ser Gly 
305 310 315 320 

Leu Thr Lys Gin Gly Leu Ala Thr Leu Leu Asp Ala Thr Ala Glu Leu 
325 330 335 

Leu Asp Lys Thr Pro Glu Phe Leu Leu Tyr Asp Glu Ser Asp Met Glu 
340 345 350 

Glu Glu Ala Tyr Tyr Gly Phe Asp Glu Glu Glu Lys Ala Phe Glu He 
355 360 365 

Ser Arg Asp Asp Asp Ala Thr Trp Val Leu Ser Gly Glu Lys Leu Met 
370 375 380 

Lys Leu Phe Asn Met Thr Asn Phe Asp Arg Asp Glu Ser Val Met Lys 
385 390 395 * 400 

Phe Ala Arg Gin Leu Arg Gly Met Gly Val Asp Glu Ala Leu Arg Ala 
405 ' 410 415 

Arg Gly Ala Lys Asp Gly Asp Leu Val Arg He Gly Lys Phe Glu Phe 
.420 425 430 

Glu Phe Val Asp 
435 



<210> 134 
<211> 172 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 134 

Met Asn Tyr Phe. Asn Val Gly Lys lie Val Asn Thr Gin Gly Leu Gin 
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15 10 15 

Gly Glu Met Arg Val Leu Ser Val Thr Asp Phe Ala Glu Glu Arg Phe 
20 25 30 

5 

Lys Lys Gly Ala Glu Leu Ala Leu Phe Asp Glu Lys Asp Gin Phe Val 
35 40 45 

Gin Thr Val Thr lie Ala Ser His Arg Lys Gin Lys Asn Phe Asp lie 
10 50 55 60 

lie Lys Phe Lys Asp Met Tyr His lie Asn Thr lie Glu Lys Tyr Lys 
65 70 75 80 

15 Gly Tyr Ser Leu Lys Val Ala Glu Glu Asp Leu Asn Asp Leu Asp Asp 

.85 90 95 

Gly Glu Phe Tyr Tyr His Glu He He Gly Leu Glu Val Tyr Glu Gly 
100 105 110 

20 

Asp Ser Leu Val Gly Thr He Lys Glu He Leu Gin Pro Gly Ala Asn 
115 120 125 

Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu Leu Leu Pro 
25 130 135 140 

Tyr lie Pro Pro Val Val Leu Asn Val Asp He Pro Asn Lys Arg Val 
145 150 155 - 160 

30 Asp Val Glu He Leu Glu Gly Leu Asp Asp Glu Asp 

165 170 



<210> 135 
35 <211> 239 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 135 

40 Met Lys He Asp lie Leu Thr Leu Phe Pro Glu Met Phe Ser Pro Leu 
1 5 10 15 

Glu His Ser He Val Gly Lys Ala Arg Glu Lys Gly Leu Leu Asp He 
20 25 30 

45 

Gin Tyr His Asn Phe Arg Glu Asn Ala Glu Lys Ala Arg His Val Asp 
35 40 45 

Asp Glu Pro Tyr Gly Gly Gly Gin Gly Met Leu Leu Arg Ala Gin Pro 
50 50 55 60 

He Phe Asp Ser Phe Asp Ala lie Glu Lys Lys Asn Pro Arg Val He 
65 .70 .75 80 

55 Leu Leu Asp Pro Ala Gly Lys Gin Phe Asp Gin Ala Tyr Ala Glu Asp 

85 90 95 
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Leu Ala Gin Glu Glu Glu Leu lie Phe lie Cys Gly His Tyr Glu Gly 
100 105 110 

Tyr Asp Glu Arg He Lys Thr Leu Val Thr Asp Glu He Ser Leu Gly 
5 115 120 125 

Asp Tyr' Val Leu Thr Gly Gly Glu Leu Ala Ala Met Thr Met He Asp 
130 135 140 

10 Ala Thr Val Arg Leu He Pro Glu Val He Gly Lys Glu Ser Ser His 
145 150 155 160 



15 



Gin Asp Asp Ser. Phe Ser Ser Gly Leu Leu Glu Tyr His Gin Tyr Thr 
165 ' * 170 175 

Arg Pro Tyr Asp Tyr Arg Gly Met Val Val Pro Asp Val Leu Met Ser 

180 185 190 



Gly His His Glu Lys He Arg Gin Trp Arg Leu Tyr Glu Ser Leu Lys 
20 195' 200 205 

Lys Thr Tyr Glu Arg Arg Pro Asp Leu Leu Glu His Tyr Gin Leu Thr 
210 215 220 



25 Val Glu Glu Glu Lys Met Leu Ala Glu He Lys Glu Asn Lys Glu 
225 230 235 



30 . 

<210> 136 
<211> 186 
<212> PRT 
35 <213> Streptococcus pneumoniae 



<400> 136 

Met He Glu Ala Ser Lys Leu Lys Ala Gly Met Thr Phe Glu Thr Ala 
1*5 10 15 

Asp Gly Lys Leu He Arg Val Leu Glu Ala Ser His His Lys Pro Gly 
20 25 30 

Lys Gly Asn Thr lie Met Arg Met Lys Leu Arg Asp Val Arg Thr Gly 
35 40 . 45 

Ser Thr Phe Asp Thr Ser Tyr Arg Pro Glu Glu Lys Phe Glu Gin Ala 
50 55 60 

50 

He He Glu Thr Val Pro Ala Gin Tyr Leu Tyr Lys Met Asp Asp Thr 
65 70 75 80 

Ala Tyr Phe Met Asn Thr Glu Thr Tyr Asp Gin Tyr Glu He Pro Val 
55 85 90 95 

Val Asn Val Glu Asn Glu Leu Leu Tyr He Leu Glu Asn Ser Asp Val 



40 



45 



74 



WO 01/49721 



PCT7US00/35604 



100 105 110 

Lys lie Gin' Phe Tyr Gly Thr Glu Val He Gly Val Thr Val Pro Thr 
115 .120 125 

Thr Val Glu Leu Thr Val Ala Glu Thr Gin Pro Ser He Lys Gly Ala 
130 135 140 

Thr Val Thr Gly Ser Gly Lys Pro Ala Thr Met Glu Thr Gly Leu Val 
145 150 155 160 

Val Asn Val Pro Asp Phe He Glu Ala Gly Gin Lys Leu Val He Asn 
165" 170 175 

Thr Ala Glu Gly Thr Tyr Val Ser Arg Ala 
180 185 



<210> 137 
<211> 523 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 137 

Met Ala Phe Glu Ser Leu Thr Glu Arg Leu Gin Asn Val Phe Lys Asn 
1 5 10 15 

Leu Arg Lys Lys Gly Lys He Ser Glu Ser Asp Val Gin Glu Ala Thr 
20. 25 30 

Lys Glu He Arg Leu Ala Leu Leu Glu Ala Asp Val Ala Leu Pro Val 
35 40 45 

Val Lys Asp Phe He Lys Lys Val Arg Glu Arg Ala Val Gly His Glu 
50 55 60 

Val He Asp Thr Leu Asn Pro Ala Gin Gin He He Lys He Val Asp 
65 70 75 80 

Glu Glu Leu Thr Ala Val' Leu Gly Ser Asp Thr Ala Glu He He Lys 
85 90 / 95 

Ser Pro Lys He Pro Thr He He Met Met Val Gly Leu Gin Gly Ala 
100 105 110 

Gly Lys Thr Thr Phe Ala Gly Lys Leu Ala Asn Lys Leu Lys Lys Glu 
115 120 125 

Glu Asn Ala Arg Pro Leu Met He Ala Ala Asp He Tyr Arg Pro Ala 
130 135 140 

Ala He Asp Gin Leu Lys Thr Leu Gly Gin Gin He Asp Val Pro Val 
145 150 155. 160 

Phe Ala Leu Gly Thr Glu Val Pro Ala Val Glu lie Val Arg Gin Gly 
165 170 175 
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10 



Leu Glu Gin Ma Gin Thr Asn His Asn Asp Tyr Val Leu He Asp Thr 
180 185 190 

Ala Gly Arg Leu Gin He Asp Glu Leu Leu Met Asn Glu Leu Arg Asp 
195 200 205 

Val Lys Thr Leu Ala Gin Pro Asn Glu He Leu Leu Val Val Asp Ala 
210 215 220 

Met He Gly Gin Glu Ala Ala Asn Val Ala Arg Glu Phe Asn Ala Gin 
225 230 235 24.0 



Leu Glu Val Thr Gly Val He Leu Thr Lys He Asp Gly .Asp Thr Arg 
15 245 250 255 

Gly Gly Ala Ala Leu Ser Val Arg His He Thr Gly Lys Pro He Lys 
260 265 270 

20 Phe Thr Gly Thr Gly Glu Lys He Thr Asp He Glu Thr Phe His Pro 
275 280 285 



25 



Asp Arg Met Ser Ser Arg He Leu Gly Met Gly Asp Met Leu Thr Leu 
290 295 300 

He Glu Lys Ala Ser Gin Glu .Tyr Asp Glu Gin Lys Ala Leu Glu Met 

305 310 315 320 



Ala Glu Lys Met Arg Glu Asn Thr Phe Asp Phe Asn Asp Phe He Asp 
30 325 330 335 

Gin Leu Asp Gin Val Gin Asn Met Gly Pro Met Glu Asp Leu Leu Lys 
340 345 350 

35 Met lie Pro Gly Met Ala Asn Asn Pro Ala Leu Gin Asn Met Lys Val 
355 360 365 



40 



Asp Glu Arg Gin He Ala Arg Lys Arg Ala He Val Ser Ser Met Thr 
370 375 380 

Pro Glu Glu Arg Glu Asn Pro Asp Leu Leu Asn Pro Ser Arg Arg Arg 
385 390 395 400 



Arg He Ala Ala Gly Ser Gly Asn Thr Phe Val Glu Val Asn Lys Phe 
45 405 410 415 

He Lys Asp Phe Asn Gin Ala Lys Gin Leu Met Gin Gly Val Met Ser 

420 425' * 430 

.50 Gly Asp Met Asn Lys Met Met Lys Gin Met Gly lie Asn Pro Asn Asn 
435 440 445 



55 



Leu Pro Lys Asn Met Pro Asn Met Gly Gly Met Asp Met Ser Ala Leu 
450 455 460 

Glu Gly Met Met Gly Gin Gly Gly Met Pro Asp Leu Ser Ala Leu Gly 

465 470 475 480 
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Gly Ala Gly Met Pro Asp Met Ser Gin Met Phe Gly Gly Gly Leu Lys 
485 490 495 

Gly Lys He Gly Glu Phe Ala Met Lys Gin Ser Met Lys Arg Met Ala 
500 505 510 

Asn Lys Met Lys Lys Ala Lys Lys Lys Arg Lys 
515 520 



<210> 138 
<211> 281 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 138 

Met Tyr Leu He Glu He Leu Lys Ser He Phe Phe Gly He Val Glu 
1 5 . 10 .■ 15 

Gly lie Thr Glu Trp Leu Pro lie Ser Ser Thr Gly His Leu He Leu 
20 25 30 

Ala Glu Glu Phe lie Gin Tyr Gin Asn Gin Asn Glu Ala Phe Met Ser 
35 40 45 

Met Phe Asn Val Val lie Gin Leu Gly Ala He Leu Ala Val Met Val 
50 55 60 

He Tyr Phe Asn Lys Leu Asn. Pro Phe Lys Pro" Thr Lys Asp Lys Gin 
65 70 75 80 

Glu Val Arg Lys Thr Trp Arg Leu Trp Leu Lys Val Leu lie Ala Thr 
85 90 95 

Leu Pro Leu Leu Gly Val Phe Lys Phe Asp Asp Trp Phe Asp Thr His 
100 105 110 

Phe His Asn Met Val Ser Val Ala Leu Met Leu lie lie Tyr Gly Val 
* 115 120 125 

Ala Phe lie Tyr Leu Glu Lys Arg Asn Lys Ala Arg Ala lie Glu Pro 
130 135 140 

Ser Val Thr Glu Leu Asp Lys Leu Pro Tyr Thr Thr Ala- Phe Tyr He 
145 150 155 160 

Gly Leu Phe Gin Val Leu Ala Leu Leu" Pro Gly Thr Ser Arg Ser Gly 
165 170 175 

Ala Thr lie Val Gly Gly Leu Leu Asn Gly Thr Ser Arg Ser Val Val 
180 185 190 

Thr Glu Phe Thr Phe Tyr Leu Gly lie Pro Val Met Phe Gly Ala Ser 
195 200 205 

Ala Leu Lys He Phe Lys Phe Val Lys Ala Gly Glu Leu Leu Ser Phe 
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210 215 

Gly Gin Leu Phe Leu Leu Leu Val 
225 230 

Ser Met Val Ala lie Arg Phe Leu 
245 

Phe Thr Leu Phe Gly Lys Tyr Arg 
260 

Leu Tyr Ser Phe Val Arg Leu Phe 
275 280 



220 

Ala Met Gly Val Ala Phe Ala Val 
235 240 

Thr Ser Tyr Val Lys Lys His Asp 
250 255 

lie Val Leu Gly Ser Val Leu Leu 
265 270 

Val 



<210> 139 
<211> 429 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 139 

Met Gly Leu Phe Asp Arg Leu Phe Gly Lys Lys Glu Glu Pro Lys lie 
15 10 15 

Glu Glu Val Val Lys Glu Ala Leu Glu Asn Leu Asp Leu Ser Glu Asp 
20 25 30 

lie Glu Pro Ala Phe Thr Glu Ala Glu Glu Val Ser Gin Glu Glu Ala 
35 40 45 

Glu Val Glu Ser Ser Glu Glu Ser Val Phe Gin Glu Glu Asp Ser Gin 
50 55 60 

Asp Thr Val Glu Glu Asn Leu Asp Leu Glu Pro Val Val Glu Val Ser 
65 70 75 80 

Gin Glu Glu Val Glu Glu Phe Pro Asn Ser Gin Glu Val Thr Glu Glu 
85 90 95 

Glu Lys Leu Glu His Glu Gly Thr Val Glu Glu Asn Asn Phe Glu Val 
100 m 105 110 

Leu Glu Pro Glu Ala Pro Gin Thr Glu Glu Thr Val Gin Glu Lys Tyr 
115 120 125 

Asp Arg Ser Leu Lys Lys Thr Arg Thr Gly Phe Gly Ala Arg Leu Asn 
130 135 140 

Ala Phe Phe Ala Asn Phe Arg Ser Val Asp Glu Glu Phe Phe Glu Glu 
145 150 155 160 

Leu Glu Glu Leu Leu -lie Met Ser Asp Val Gly Val Gin Val Ala Ser 
165 170 175 

Asn Leu Thr Glu Glu Leu Arg Tyr Glu Ala Lys Leu Glu Asn Ala Lys 
180 185 190 
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Lys Pro Asp Ala Leu Arg Arg Val lie lie Glu Lys Leu Val Glu Leu 

195 200 205 

Tyr Glu Lys Asp Gly Ser Tyr Asp Glu Ser lie His Phe Gin Asp Asn 

210 215 220 



Leu Thr Val Met Leu Phe Val Gly Val Asn Gly Val Gly Lys Thr Thr 
225 ' 230 235 240 

Ser lie Gly Lys Leu Ala His Arg Tyr Lys Arg Ala Gly Lys Lys Val 
245 250 255 

Met Leu Val Ala Ala Asp Thr Phe Arg Ala Gly Ala Val Ala Gin Leu 
260 265 270 



Ala Glu Trp Gly Arg Arg Val Asp 
275 280 

Lys Ala Asp Pro Ala Ser Val Val 
290 295 

Ala Glu Gly He Asp He Leu Met 
305 310 

Asn Lys Asp Asn Leu Met Ala Glu 

325' 



Val Pro Val Val Thr Gly Pro Glu 
285 

Phe Asp Gly Met Glu Arg Ala Val 
300 

He Asp Thr Ala Gly Arg Leu Gin 
315 320 

Leu Glu Lys He Gly Arg He He 
330 335 



Lys Arg Val Val Pro Glu Ala Pro His Glu Thr Phe Leu Ala Leu Asp 
340 345 350 

Ala Ser Thr Gly Gin Asn Ala Leu Val .Gin Ala Lys Glu Phe Ser Lys 
355 360 365 

He Thr Pro Leu Thr Gly He Val Leu Thr Lys He Asp Gly Thr Ala 
370 375 380 



Arg Gly Gly Val Val Leu Ala He 
385 * 390 

Lys Leu He Gly Phe Gly Glu Lys 
405 

Ser Glu Asn Phe Met Lys Gly Leu 
420 



Arg Glu Glu Leu Asn He Pro Val 
395 400 

He Asp Asp He Gly Glu. Phe Asn 
410 415 

Leu Glu Gly Leu He 
425 



<210> 140 
<211> 165 
<212> £RT 

<213> Streptococcus pneumoniae 
<400> 1.40 

Met Tyr He Glu Met Val Asp Glu Thr Gly Gin Val Ser Lys Glu Met 
15 10 15 



Leu Gin Gin Thr Gin Glu He Leu Glu Phe Ala Ala Lys Lys Leu Gly 
20 25 30 
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Lys Glu Asp Lys Glu Met Ala Val Thr Phe Val Thr Asn Glu Arg Ser 
35 40 45 

His Glu Leu Asn Leu Glu Tyr Arg Asp Thr Asp Arg Pro Thr Asp Val 
50 55 60 

lie Ser Leu Glu Tyr Lys Pro Glu Leu Glu lie Ala Phe Asp Glu Glu 
65 70 75 80 

Asp Leu Leu Glu Asn Pro Glu Leu Ala Glu Met Met Ser Glu Phe Asp 
85 90 . 95 

Ala Tyr lie Gly Glu Leu Phe lie Ser He Asp Lys .Ala His Glu Gin 
100 105 110 

Ala Glu Glu Tyr Gly His Ser Phe Glu Arg Glu Met Gly Phe Leu Ala 
11'5 120 125 

Val His Gly Phe Leu His lie Asn Gly Tyr Asp His Tyr Thr Pro Glu 
130 135 140 

Glu Glu Ala Glu Met Phe Gly Leu Gin Glu Glu He Leu Thr Ala Tyr 
145 150 155 160 

Gly Leu Thr Arg Gin 
165 



<210> 141 
<211> 255 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 141 

Met Ser He Arg Val He He Ala Gly Phe Lys Gly Lys Met Gly Gin 
15 10 15 

Ala Ala Cys Gin Met Val Leu Thr Asp Pro Asp Leu Asp Leu Val Ala 
20 25 30. 

Val Leu Asp Pro Phe Glu Ser Glu Ser Glu Trp Gin Gly He Pro Val 
35 ' 40 45 

Phe Lys Asp Lys Ala Asp Leu Ala Gly Phe Glu Ala Asp Val Trp Val 
50 55 60 

Asp Phe Thr Thr Pro Ala Val Ala Tyr Glu Asn Thr Arg Phe Ala Leu 
65 .70 75 80 

Glu Asn Gly Phe Ala Pro Val Val Gly Thr Thr Gly Phe Thr Ser Glu 
85 90 95 

Glu He Ala Glu Leu Lys Glu Phe Ser Arg Ala Gin Asp Leu Gly Gly 
' 100 105 110 

Leu lie Ala Pro Asn Phe Ala Leu Gly Ala Val Leu Leu Met Gin Phe 
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115 120 125 

Ala Thr Gin Ala Ala Lys Tyr Phe Pro Asn Val Glu He He Glu Leu 
130 135 140 

His His Asp Lys Lys Lys Asp Ala Pro Ser Gly Thr Ala He Lys Thr 
145 150 155 160 



Ala Glu Leu Met Ala Glu Val 
10 165 

Asp Glu Glu Glu Leu He Ala 
180 

15 Met Arg He His Ser Val Arg 
195 

Val He Phe Gly Asn Gin Gly 
210 215 

20 

Tyr Asp Arg He Ser Phe Met 
225 230 

Val Val Lys Arg His Glu Leu 
25 245 



Arg Glu Ser He Gin Gin' Gly Ala Ala 
170 175 

Gly Ala Arg Gly Ala Asp Phe Asp Gly 
185 190 

Leu Pro Gly Leu Val Ala His Gin Glu 
200 205 

Glu Gly Leu Thr Leu Arg His Asp Ser 
220 

Thr Gly Val Asn Leu Gly He Lys Glu 
235 240 

Val Tyr Gly Leu Glu His Leu Leu 
250 255 



<210> 142 
<211> 91 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 142 

Met Ala Asn Lys Gin Asp Leu He Ala. Lys Val Ala Glu Ala Thr Glu 
35 1 5 10 15 

Leu Thr Lys Lys Asp Ser Ala Ala Ala Val Glu Ala Val Phe Ala Ala 
20 25 30 - 

40 Val Ala Asp Tyr Leu Ala Ala Gly Glu Lys Val Gin Leu He Gly Phe 
35 40 45 

Ser Asn Phe Glu Val Arg* Glu Arg Ala Glu Arg Lys Gly Arg Asn Pro 
50 55 60 

45 

Gin Thr Gly Lys Glu Met Thr He Ala Ala Ser Lys Val Pro Ala Phe- 
65 70 ... 75 80 

Lys Ala Gly Lys Ala Leu Lys Asp Ala Val Lys 
50 85 90 



<210> 143 
<211> 306 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 143 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 
15 10 15 

Gly Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 
20 25 30 

He Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu He 
35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn .Gin Thr Arg Tyr Thr Gin Pro Ala 
50 55 60 

He Leu Ala Thr Ser Val Ala lie Tyr Arg Leu Leu Gin Glu Lys Gly 
65 70 75 80 



Tyr Gin Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 
85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 
100 105 110 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 
115 120 125 

Lys Met Val Ala Val Leu Asn Thr Pro Val Glu Val He Glu Glu Ala 
130 135 140 

Cys Gin, Lys Ala Ser Glu Leu Gly Val Val Thr Pro Ala Asn Tyr Asn 
145 ' 150 155 160 

Thr Pro Ala Gin He Val He Ala Gly Glu Val Val Ala Val Asp Arg 
165 . 170 175 

Ala Val Glu Leu Leu Gin Glu Ala Gly Ala Lys Arg Leu He Pro Leu 
180 185 190 

Lys Val Ser Gly Pro Phe His Thr Ala Leu Leu Glu Pro Ala Ser Gin 
195 200 205 

Lys Leu Ala .Glu Thr Leu Ala Gin Val Ser Phe Ser Asp Phe Thr Cys 
210 215 220 

Pro Leu Val Gly Asn Thr Glu Ala Ala Val Met Gin Lys Glu Asp He 
225 230 235 *• 240 

Ala Gin Leu Leu Thr Arg Gin Val Lys -Glu Pro Val Arg Phe Tyr Glu 
245 250. 255 

Ser He Gly Val Met. Gin Glu Ala Gly He Ser Asn Phe He Glu He 
260 . 265 270 

Gly Pro Gly Lys Val Leu Ser Gly Phe Val Lys Lys He Asp Gin Thr 
275 280 285 



Ala His Leu Ala His Val Glu Asp Gin Ala Ser Leu Val Ala Leu Leu 
290 295 300 
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Glu Lys 
305 



<210> 144 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 144 

Met Lys Leu- Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arg Gly 
1 5 10 15 

He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Ala Asn He 
20 25 '30 

Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 
35 40 45 

Ser Asn Tyr Gly He Lys Val Val Pro lie Ser Gly Asp Val Ser Asp 
50 * 55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 
85 90 95 

Met Leu Lys Met Thr Glu. Ala Asp Phe Glu Lys Val Leu Lys Val Asn 
100 - . 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Leu Lys Pro Met Met 
115 120 125 

Lys Ala Arg Glu Gly Ala He He Asn Met Ser Ser Val Val Gly Leu 
130 135 140 

Met Gly Asn He Gly Gin Ala Asn Tyr Ala Ala Ser Lys Ala Gly Leu 
145 150 . 155 160 

He Gly Phe Thr Lys Ser Val Ala Arg Glu Val Ala Ser Arg Asn He 
165 170 175 

Arg Val Asn Val He Ala Pro Gly Met He Glu Ser Asp Met Thr Ala 
180 185 190 

He Leu Ser Asp Lys He Lys Glu Ala Thr Leu Ala Gin lie Pro Met 
195 200 205 

Lys Glu Phe Gly Gin Ala Glu Gin Val Ala Asp Leu Thr Val Phe Leu 
210 215 220 

Ala Gly Gin Asp Tyr Leu Thr Gly Gin Val Val Ala He Asp Gly Gly 
225 230 235 240 

Leu Ser Met 
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<210> 145 
<211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 145 

Met Gly Val Lys Lys Lys Leu Lys Leu Thr Ser Leu Leu Gly Leu Ser 
1 5 .10 15 

Leu Leu lie Met Thr Ala Cys Ala Thr Asn Gly Val Thr Ser Asp lie 
20 25 30 

Thr Ala Glu Ser Ala Asp Phe Trp Ser Lys Leu Val Tyr Phe Phe Ala 
35 40 45 

Glu lie lie Arg Phe Leu Ser Phe Asp He- Ser He Gly Val Gly He 
50 55 60 

He Leu Phe Thr Val Leu He Arg Thr Val Leu Leu Pro Val Phe Gin 
65 70 75 80 

Val Gin Met Val Ala Ser Arg Lys Met Gin Glu Ala Gin Pro Arg He 
85 90 95 

Lys Ala Leu Arg Glu Gin Tyr Pro Gly Arg Asp Met Glu Ser Arg Thr 
100 105 110 

Lys Leu Glu Gin Glu Met Arg Lys Val Phe Lys Glu Met Gly Val Arg 
115 120 125 

Gin Ser Asp Ser Leu Trp Pro He Leu He Gin Met Pro Val He Leu 
130 135 140 

Ala Leu Phe Gin Ala Leu Ser Arg Val Asp Phe Leu Lys Thr Gly His 
145 150 155 160 

Phe Leu Trp He Asn Leu Gly Ser Val Asp Thr Thr Leu Val Leu Pro 
165 170 175 

He Leu Ala Ala Val Phe Thr Phe Leu Ser Thr Trp Leu Ser Asn Lys 
180 185 190 

Ala Leu Ser Glu Arg Asn Gly Ala Thr Thr Ala Met Met Tyr Gly He 
195 200 205 . 

Pro Val Leu He Phe He Phe Ala Val Tyr Ala Pro Gly Gly Val Ala 
210 215 220 

Leu Tyr Trp Thr Val Ser Asn Ala Tyr Gin Val Leu Gin Thr Tyr Phe 
225 230 235 240 

Leu Asn Asn Pro Phe Lys He He Ala Glu Arg Glu Ala Val Val Gin 
245 250 255 
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Ala Gin Lys Asp Leu Glu Asn Arg Lys Arg Lys Ala Lys Lys Lys Ala 
260 265 270 

Gin Lys Thr Lys 
5 275 



<210> 146 
<211> 409 
10 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 146 

Met Lys lie Ser Lys Arg His Leu Leu Asn Tyr Ser lie Leu lie Pro 
15 1 5 10 15 

Tyr Leu Leu Leu Ser lie Leu Gly Leu He Val Val Tyr Ser Thr Thr 
20 25 30 

20 Ser Ala He Leu He Glu Glu Gly Lys Ser Ala Leu Gin Leu Val Arg 
35 40 45 

Asn Gin Gly He Phe Trp He Val Ser Leu He Leu He Ala Leu He 
50 55 60' 

25 

Tyr Lys Leu Arg Leu Asp Phe Leu Arg Asn Glu Arg Leu He He' Leu 
65 70 75 80 

Val He Leu He Glu Met Leu Leu Leu Phe Leu Ala Arg Phe He Gly 
30 ' 85 90 95 

He Ser Val Asn Gly Ala Tyr Gly Trp He Ser Val Ala Gly Val Thr 
100 105 110 

35 He Gin Pro Ala Glu Tyr Leu Lys He He He He Trp Tyr Leu Ala 
115 120 125 

His Arg Phe Ser Lys Gin Gin Glu Glu He Ala Thr Tyr Asp Phe Gin 
130 135 140 

40 

Val Leu Thr Gin Asn Gin Trp Leu Pro Arg Ala Phe Asn Asp Trp Arg 
145 150 155 160 

Phe Val Leu Leu Val Leu He Gly Ser Leu Gly He Phe Pro Asp Leu 
45 165 170 175 

Gly Asn Ala Thr lie Leu Val Leu Val - Ser Leu He Met Tyr Thr Val 
180 185 190 

50 Ser Gly He Ala Tyr Arg Trp Phe Ser Thr He Leu Ala Leu Val Ser 
195 200 205 

Ala Thr Ser Val Phe Val Leu Thr Thr He Ser Leu He Gly Val Glu 
210 215 220 

55 

Thr Phe Ser Lys He Pro Val Phe Gly Tyr Val Ala Lys Arg Phe Ser 
225 230 235 240 
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Ala Phe Phe Asn Pro Phe Ala Asp Arg Ala Asp Ala Gly His Gin Leu 
245 250 255 

Ala Asn Ser Tyr Phe Ala Met Val Asn Gly Gly Trp Phe Gly Leu Gly 
260 265 270 

Leu Gly Asn Ser lie Glu Lys Arg Gly Tyr Leu Pro Glu Ala His Thr 
275 280 285 

Asp Phe Val Phe Ser lie Val He Glu Glu Phe Gly Phe Val Gly Ala 
290 295 300 

Ser Leu He Leu Ala Leu Leu Phe Phe Met He Leu Arg He He Leu 
305 310 315 320 

Val Gly He Arg Ala Glu Asn Pro Phe Asn Ala Met Val Ala Leu Gly 
325 330 335 

Val Gly Gly Met Met Leu Val Gin Val Phe Val Asn He Gly Gly He 
340 345 350 

Ser Gly Leu He Pro Ser Thr Gly Val Thr Phe Pro Phe Leu Ser Gin 
355 360 365 

Gly Gly Asn Ser Leu Leu Val Leu Ser Val Ala Val Ala Phe Val Leu 
370 375 380 

Asn He Asp Ala Ser Glu Lys Arg Ala Lys Leu Tyr Arg Glu Leu Glu 
385 390 395 400 



Asn Gin Pro Met Asn Leu Leu Leu Lys 
405 



<210> 147 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 147 

Met Leu Gly He Leu Thr Phe He 
1 5 

Val His* Glu Phe Gly His Phe Tyr 
20 

Val Arg Glu Phe Ala He Gly Met 

35 40 

Gly Lys Asp Gly Thr Ala Tyr Thr 
50 55 

Tyr Val Arg Met Ala Gly Trp Gly 
65 70 

Gly Thr Pro Val Ser Leu Thr Leu 



Leu Val Phe Gly He He Val Val 
10 15 

Phe Ala Lys Lys Ser Gly He Leu 
25 30 

Gly Pro Lys lie Phe Ala His He . 

45 • 

He Arg He Leu Pro Leu Gly Gly 
60 

Asp Asp Thr Thr Glu He. Lys Thr 
75 * 80 

Ala Asp Asp Gly Lys Val Lys Arg 
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85 



90 



95 



lie Asn Leu Ser Gly Lys Lys Leu Asp Gin Thr Ala Leu Pro Met Gin 
100 105 110 

5 

Val Thr Gin Phe Asp Phe Glu Asp Lys Leu Phe He Lys Gly Leu Val 
115 120 * 125 

Leu Glu Glu Glu Lys Thr Phe Ala Val Asp His Asp Ala Thr Val Val 
10 130 135 140 



15 



Glu Ala Asp Gly Thr Glu Val Arg He Ala Pro Leu Asp Val Gin Tyr 
145 150 155 160 

Gin Asn Ala Thr He Trp Gly Lys Leu He Thr Asn Phe Ala Gly Pro 
165 170 175 



20 



Met Asn Asn Phe He Leu Gly Val Val Val Phe Trp Val Leu He Phe 
180 185 190 

Met Gin Gly Gly Val Arg Asp Val Asp Thr Asn Gin Phe His He Met 
195 200 205 



Pro Gin Gly Ala Leu Ala Lys Val Gly Val Pro Glu Thr Ala Gin He 
25 210 215 220 



30 



Thr Lys He Gly Ser His Glu Val Ser Asn Trp Glu Ser Leu He Gin 

225 230 235 240 

Ala Val Glu Thr Glu Thr Lys Asp Lys Thr. Ala Pro Thr Leu Asp Val 

245 250 255 



35 



Thr He Ser Glu Lys Gly Ser Asp Lys Gin Val Thr Val Thr Pro Glu 
260 265 270 

Asp Ser Gin Gly Arg Tyr Leu Leu Gly Val Gin Pro Gly Val Lys Ser 
275 280 285 



Asp Phe Leu Ser Met Phe Val Gly Gly Phe Thr Thr Ala Ala Asp Ser 
40 290 295 300 



45 



Ala Leu Arg He Leu Ser Ala Leu Lys Asn Leu He Phe Gin Pro Asp 

305 310 315 320 

Leu Asn Lys Leu Gly Gly Pro Val Ala He Phe. Lys Ala Ser Ser Asp 

325 330 335 



50 



Ala Ala Lys Asn Gly He Glu Asn lie" Leu Tyr Phe Leu Ala Met He 
340 345 350 

Ser He Asn He Gly He Phe Asn ieu He Pro lie Pro Ala Leu Asp 
355 360 365 



Gly Gly Lys lie Val Leu Asn He Leu Glu Ala He Arg Arg Lys Pro 
55 370 375 380 



Leu Lys Gin Glu He Glu Thr Tyr Val Thr Leu Ala Gly Val Val lie 
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385 



390 



395 



400 



Met Val Val Leu Met He Ala Val Thr Trp Asn Asp He Met Arg Leu 
405 410 415 



Phe Phe Arg 



10 <210> 148 
<211> 197 
<212> PRT 

<213> Streptococcus pneumoniae 
15 <400> 148 

Met Tyr Ala Tyr Leu Lys Gly lie He Thr Lys He Thr Ala Lys Tyr 
15 10 15 

He Val Leu Glu Thr Asn Gly He Gly Tyr He Leu His Val Ala Asn 
20 20 * 25 30 

Pro Tyr Ala Tyr Ser Gly Gin Val Asn Gin Glu Ala Gin He Tyr Val 
35 40 45 

25 His Gin Val Val Arg Glu Asp Ala His Leu Leu Tyr Gly Phe Arg Ser 
50 .55 60 

Glu Asp Glu Lys Lys Leu Phe Leu Ser Leu He Ser Val Ser Gly He 
65 70 75 80 

30 

Gly Pro Val Ser Ala Leu Ala He He Ala Ala Asp Asp Asn Ala Gly 
85 90 ' 95 

Leu Val Gin Ala He Glu Thr Lys Asn He Thr Tyr Leu Thr Lys Phe 
35 100 105 110 

Pro Lys He Gly Lys Lys Thr Ala Gin Gin Met Val Leu Asp Leu Glu 
115 120 125 

40 Gly Lys Val Val Val Ala Gly Asp Asp Leu Pro Ala Lys Val Ala Val 
130 135 140 ' 

Gin Ala Ser Ala Glu Asn Glh Glu Leu Glu Glu Ala Met Glu Ala Met 
145 150 155 160 

45 

Leu Ala Leu Gly Tyr Lys Ala Thr Glu Leu Lys Lys He Lys Lys Phe 
165 ..170 175 

Phe Glu Gly Thr Thr Asp Thr Ala Glu Asn Tyr He. Lys Ser Ala Leu 
50 180 185 190 

• Lys Met Leu Val Lys 
195 

55 

<210> 149 
<211> 257 
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<212> PRT 

<213> Streptococcus pneumoniae 
<400> 149 

5 Met Lys Asn Asn Arg He Leu Ala Leu Ser Gly Asn Asp He Phe Ser 
1 5 10 15 

Gly Gly Gly Leu Ser Aia Asp Leu Ala Thr Tyr Thr Leu Asn Gly Leu 
20 25 30 

10 

His Gly Phe Val Ala Val Thr Cys Leu Thr Ala Leu Thr Glu Lys Gly 
35 40 45 

Phe Glu Val Phe Pro Thr Asp Asp Thr He Phe Gin His Glu Leu Asp 
15 50 55 60 

Ser Leu Arg Asp Val Glu Phe Gly Gly He Lys He Gly Leu Leu Pro 
65 70 75 80 

20 Thr Val Ser Val Ala Glu Lys Ala Leu Asp Phe He Lys Gin Arg Pro 

85 90 95 

Gly Val Pro Val Val Leu Asp Pro Val Leu Val Cys Lys Glu Thr His 
100 105 110 

25 

Asp Val Ala Val Ser Glu Leu Cys Gin Glu Leu He Arg Phe Phe Pro 
115 120 125 

Tyr Val Ser Val He Thr Pro Asn Leu Pro Glu Ala Glu Leu Leu Ser 
30 130 135 140 

Gly Gin Glu He Lys Thr Leu Glu Asp Met Lys Thr Ala Ala Gin Lys 
145 150 155 160 

35 Leu His Asp Leu Gly Ala Pro Ala Val He lie Lys Gly Gly Asn Arg 

165 170 175 

Leu Ser Gin Asp Lys Ala Val Asp Val Phe Tyr .Asp Gly Gin Thr Phe 
180 i85 190 

40 

Thr He Leu Glu Asn Pro Val He Gin Gly Gin Asn Ala Gly Ala Gly 
195 200 205 

Cys Thr Phe Ala Ser Ser He Ala Ser His Leu Val Lys Gly Asp Lys 
45 210 215 220 

Phe Leu Pro Ala Val Glu Ser Ser Lys.:Ala Phe Val Tyr Arg Ala He 
225 230 235 240 

50 Ala Gin Ala Asp Gin Tyr Gly Val Arg Gin Tyr Glu Ala Asn Lys Asn 

245 250 255 

Asn 

55 

<210> 150 
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<211> 412 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 150 

Met He Glu Thr Glu Lys Lys Glu Glu Arg Val Leu Leu He Gly Val 
1 5 10 15 

Glu Leu Gin Gly Met Asp Ser Phe Asp Leu Ser Met Glu Glu Leu Ala 
20 25 30 

Ser Leu Ala Lys Thr Ala Gly Ala Val Val Val Asp Ser Tyr Arg Gin 
35 40 45 

Lys Arg Glu Lys Tyr Asp Ser Lys Thr Phe Val Gly Ser Gly Lys Leu 
. 50 55 60 

Glu Glu He Ala Leu Met Val Asp Ala Glu Glu He Thr Thr Val He 
65 70 75 80 

Val Asn Asn Arg Leu Thr Pro Arg Gin Asn Val Asn Leu Glu Glu Val 
85 90 95 

Leu Gly Val Lys Val He Asp Arg Met Gin Leu He Leu Asp He Phe 
100 105 110 

Ala Met Arg Ala Arg Ser His Glu Gly Lys Leu Gin Val His Leu Ala 
115 120 125 

Gin Phe Lys Tyr Leu Leu Pro Arg Leu Val Gly Gin Gly He Met Leu 
130 135 140 

Ser Arg Gin Ala Gly Gly He Gly Ser Arg Gly Pro Gly Glu Ser Gin 
145 150 155 160 

Leu Glu Leu Asn Arg Arg Ser Val Arg Asn Gin He Thr Asp He Glu 
165 170 175 

Arg Gin Leu Lys Val Val Glu Lys Asn Arg Ala Thr Val Arg Glu Lys 
180 185 190 

Arg Leu Glu Ser Ser Thr Phe Lys He Gly Leu He Gly Tyr Thr Asn 
195 200 205 

Ala Gly Lys Ser Thr He Met Asn He Leu Thr Ser Lys Thr Gin Tyr 
210 215 220 

Glu Ala Asp Glu Leu Phe Ala Thr Leu Asp Ala Thr Thr Lys Ser He 
225 230 235 , 240 

His Leu Gly Gly Asn Leu Gin Val Thr Leu Thr Asp Thr Val Gly Phe 
245 250 255 

He Gin Asp Leu Pro Thr Glu" Leu Val Ser Ser Phe Lys Ser Thr Leu 
260 265 270 

Glu Glu Ser Lys His Val Asp Leu Leu. Val His Val He Asp Ala Ser 
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275 280 285 

Asn Pro Tyr His Glu Glu His Glu Lys Thr Val Leu Ser lie Met Lys 
290 ■' 295 300 

Asp Leu Asp Met Glu Asp lie Pro His Leu Thr Leu Tyr Asn Lys Ala 
305 310 315 320 

Asp Leu Val Glu Asp Phe Thr Pro Thr Gin Thr Pro Tyr Thr Leu He 
325 330 335 

Ser Ala Lys Ser Glu Asp Ser Arg Glu Asn Leu Gin Ala Leu Leu Leu 
340 345 350 

Asp Lys He Lys Glu He Phe Glu Ala Phe Thr Leu Arg Val Pro Phe 
355 .360 365. 

Ser Lys Ser Tyr Lys He His Asp Leu Glu Ser Val Ala He Leu Glu 
370- . .375 380 

Glu Arg Asp Tyr Gin Glu Asp Gly- Glu Val He Thr Gly Tyr He Ser 
385 390 395 400 

Glu Lys Asn Lys Trp Arg Leu Glu Glu Phe Tyr Asp 
405 410 



<210> 151 
<211> 160 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 151 

Met Ala Glu Lys Thr Tyr Pro Met Thr Leu Glu Glu Lys Glu Lys Leu 
1 5 10 15 

Glu Lys Glu Leu Glu Glu Leu Lys Leu Val Arg Arg Pro Glu Val Val 
20 25 30 

Glu Arg He Lys He Ala Arg Ser Tyr Gly Asp Leu Ser Glu Asn Ser 
35 . 40 45 ■ 

Glu Tyr Glu Ala Ala Lys Asp- Glu Gin Ala Phe Val Glu Gly Gin He 
50 55 '60 

Ser Ser Leu Glu Thr Lys He Arg Tyr Ala Glu He Val Asn Ser Asp 
65 70 - - 75 80 

Ala Val Ala Gin Asp Glu Val Ala He Gly Lys Thr Val Thr He Gin 
85 90 95 

Glu He Gly Glu Asp Glu Glu Glu Val Tyr He He Val Gly Ser Ala 
100 105 110 

Gly Ala Asp Ala Phe Ala Gly Lys Val Ser Asn Glu Ser - Pro He Gly 
115 120 * 125 
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Gin Ala Leu lie Gly Lys Lys Thr Gly Asp Thr Ala Thr lie Glu Thr 
130 135 140 

Pro Val Gly Ser Tyr Asp Val Lys lie Leu Lys Val Glu Lys Thr Ala 
145 150 155 160 



<210> 152 
<211> 189 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 152 

Met Thr Lys Leu Leu Val Gly Leu Gly Asn Pro Gly Asp Lys Tyr Phe 
1 5 10 15 

Glu Thr Lys His Asn Val Gly Phe Met Leu He Asp Gin Leu Ala Lys 
20 25 30 

Lys Gin Asn Val Thr Phe Thr His Asp Lys He Phe Gin Ala Asp Leu 
35 40 45 

Ala Ser Phe Phe Leu Asn Gly Glu Lys He Tyr Leu Val Lys Pro Thr 
50 55 60 

Thr Phe Met Asn Glu Ser Gly Lys Ala Val His Ala Leu Leu Thr Tyr 
65 70 75 80 

Tyr Gly Leu Asp He Asp Asp Leu Leu He He Tyr Asp Asp Leu Asp 
85 90 95 

Met Glu Val Gly Lys He Arg Leu Arg .Ala Lys Gly Ser Ala Gly Gly 
100 105 . 110 

His Asn Gly He Lys Ser He He Gin His He Gly Thr Gin Val Phe 
115 120 125 

Asn Arg Val Lys He Gly He Gly Arg Pro Lys Asn Gly Met Ser Val 
130 135 140 

Val His His Val Leu Ser Lys Phe Asp Arg Asp Glu Tyr He Gly He 
145 150 155 160 

Leu Gin Ser Val Asp Lys Val Asp Asp -Ser Val Asn Tyr Tyr Leu Gin 
165 . 170 175 

Glu Lys Asn Phe Glu Lys Thr Met Gin Arg Tyr Asn Gly 
180 185 



<210> 153 . 
<211> 283 
<212> PRT 

<213> Streptococcus pneumoniae 
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<400> 153 

Met He Leu He Thr Gly Ala Asn 
1 5 

Tyr Leu Leu Asp Glu Arg Asn Glu 
20 

Lys Met Asp He Thr Asn Glu.Glii 
35 40 

Val Lys Pro Thr Leu Val Tyr His 
■ 50 55 



Gly Gin Leu Gly Thr Glu Leu Arg 
10 15 

Glu Tyr Val Ala Val Asp Val Ala 
25 30 

Met Val Glu Lys Val Phe Glu Glu 
45 

Cys Ala Ala Tyr Thr Ala Val Asp 
60 



Ala Ala Glu Asp Glu Gly Lys Glu Leu Asp Phe Ala "He Asn Val Thr 
65 70 75 80 

Gly" Thr Lys Asn yal Ala Lys Ala Ser Glu Lys His Gly Ala Thr Leu 
85 90 95 

Val Tyr lie Ser Thr Asp Tyr Val Phe Asp Gly Lys Lys Pro Val Gly 
100 105 110 

Gin Glu Trp Glu Val Asp Asp Arg Pro Asp Pro Gin Thr Glu Tyr Gly 
115 120 125 

Arg Thr Lys Arg Met Gly Glu Glu Leu Val Glu Lys His Val Ser Asn 
130 135 140 

Phe Tyr He He Arg Thr Ala Trp Val Phe Gly Asn Tyr Gly Lys Asn 
145 150 . • • 155 160 

Phe Val Phe Thr Met Gin Asn Leu Ala Lys Thr His Lys Thr Leu Thr 
165 - 170 175 

Val* Val Asn Asp Gin Tyr Gly Arg Pro Thr Trp Thr Arg Thr Leu Ala 
180 .185 190 

Glu Phe Met Thr Tyr Leu Ala Glu Asn Arg Lys Glu Phe Gly Tyr Tyr 
195 200 205 

His Leu Ser Asn Asp Ala Thr Glu Asp Thr Thr Trp Tyr Asp Phe Ala 
210 215 220 

Val Glu lie Leu Lys Asp Thr Asp Val Glu Val Lys' Pro Val Asp Ser 
225 230 235 240 

Ser Gin Phe Pro Ala Lys Ala Lys Arg Pro Leu Asn Ser Thr Met Ser 
245 250 255 

Leu Ala Lys Ala Lys Ala Thr Gly Phe Val He Pro Thr Trp Gin Asp 
260 265 270 



Ala Leu Gin Glu Phe Tyr Lys Gin Glu Val Arg 
275 280 
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<210> 154 
<211> 407 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 154 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 
15 10 15 

Val Phe Phe Leu Leu Val lie Gly Val Val Ala lie Tyr lie Ala Val 
20 25 30 

Ser His Asp Tyr Pro Asn Asn lie Leu Pro lie Leu Gly Gin Gin Val 
35 40 45 

Ala Trp lie Ala Leu Gly Leu Val lie Gly Phe Val Val Met Leu Phe 
50 55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Leu Gly 
65 70 75 80 

Leu Gly Leu Met He Leu Pro He Val Phe Tyr Asn Pro Ser Leu Val 
85 90 95 

Ala Ser Thr Gly Ala Lys Asn Trp Val Ser He Asn Gly He Thr Leu 
100 105 110 

Phe Gin Pro Ser Glu Phe Met Lys He Ser Tyr He Leu Met Leu Ala 
115 120 125 

Arg Val He Val Gin Phe Thr Lys Lys His Lys Glu Trp Arg Arg Thr 
130 135 140 

Val Pro Leu Asp Phe Leu Leu lie Phe Trp Met He Leu Phe Thr He 
145 150 155 160 

Pro Val Leu Val Leu Leu Ala Leu Gin Ser Asp Leu Gly Thr Ala Leu 
165 170 175 

Val Phe Val Ala He Phe Ser Gly He Val Leu Leu Ser Gly Val Ser 
180 185 190 

Trp Lys lie He He Pro Val Phe Val Thr Ala Val Thr Gly Val Ala 
195 200 205 

Gly Phe Leu Ala He Phe He Ser Lys Asp Gly Arg Ala Phe Leu His 
210 215 220 

Gin He Gly Met Pro Thr Tyr Gin He Asn Arg He Leu Ala Trp Leu 
225 230 235 240 

Asn Pro Phe Glu Phe Ala Gin Thr Thr Thr Tyr Gin Gin Ala Gin Gly 
245 / 250 255 

Gin He Ala He Gly Ser Gly Gly Leu Phe. Gly Gin Gly Phe Asn Ala 



94 



WO 01/49721 



PCT/US00/35604 



260 265 270 

Ser Asn Leu Leu lie Pro Val Arg Glu Ser Asp Met He Phe Thr Val 
275 280 285 

5 

He Ala Glu Asp Phe Gly Phe He Gly Ser Val Leu Val He Ala Leu 
290 295 300 

Tyr Leu Met Leu lie Tyr Arg Met Leu Lys lie Thr Leu Lys Ser. Asn 
10 305 310 315 320 

Asn Gin Phe Tyr Thr Tyr lie Ser Thr Gly Leu He- Met Met Leu Leu 
325 330 335 

15 Phe His He Phe Glu Asn lie Gly Ala Val Thr Gly Leu Leu Pro Leu 
340 345 350 



20 



30 



35 



45 



Thr Gly lie Pro Leu Pro Phe lie Ser Gin Gly Gly Ser Ala He He 
355 360 365 

Ser Asn Leu He Gly Val Gly Leu Leu Leu Ser Met Ser Tyr Gin Thr 
370 375 380 



Asn Leu Ala Glu Glu Lys Ser Gly Lys Val Pro Phe Lys Arg Lys Lys 
25 385 390 395 400 



Val Val Leu Lys, Gin lie Lys 
' 405 



<210> U55 
<211> 202 
<212> PRT 

<213> Streptococcus pneumoniae' 



<400> 155 

Met Gly Lys He lie Gly lie Thr Gly Gly He Ala Ser Gly Lys Ser 
1.5 10 15 

40 Thr Val Thr Asn Phe Leu Lys His Gin Gly Leu Ser Ser Ser Gly Leu 
20 25 30 

Pro Thr. Gin Cys Ser Thr Asn Tyr Arg Lys Pro Gly Gly Arg Leu Phe 
35 40 45 



Glu Ala Leu Val Gin His Phe Gly Gin Glu lie lie Leu Glu Asn Gly 
50 55 • - 60 



Glu Leu Asn Arg Pro Leu lie Ala Ser Leu lie Phe Ser Asn Pro Glu 
50 65 70 75 80 

Glu Gin Lys Trp Ser Asn Gin lie Gin Gly Glu lie lie Arg Glu Glu 
. 85 90 95 

55 Leu Ala Thr Leu Arg Glu Gin Leu Ala Gin Thr Glu Glu He Phe Phe 
100 105 110 



95 



WO 01/49721 



PCT/US00/35604 



Met Asp lie Pro Leu Leu Phe Glu Gin Asp Tyr Ser Asp Trp Phe Ala 
115 120 125 

Glu Thr Trp Leu Val Tyr Val Asp Arg Asp Ala Gin Val Glu Arg Leu' 
5 130 135 140 

Met Lys Arg Asp Gin Leu Ser Lys Asp Glu Ala Glu" Ser Arg Met Ala 
145 150 155 160 

10 Ala Gin Trp Pro Leu Glu Lys Lys Lys Asp Leu Ala Ser Gin Val Leu 

165 170 175 

Asp Asn Asn Gly Asn Gin Asn Gin Leu Leu Asn Gin Val His lie Leu 
180 185 190 

15 

Leu Glu Gly Gly Arg Gin Asp Asp Arg Asp 
195 200 



20 <210> 156 
<211> 419 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 156 

Met Arg Lys lie Val lie Asn Gly Gly Leu Pro Leu Gin Gly Glu lie 
1 5 10 15 ■ 

Thr lie Ser Gly Ala Lys Asn Ser Val Val Ala Leu He Pro Ala He 
30 20 25 30 

He Leu Ala Asp Asp Vai Val Thr Leu Asp Cys Val Pro Asp He Ser 
35 40 45 

35 Asp Val Ala Ser Leu Val Glu He Met Glu Leu Met Gly Ala Thr Val 
50 55 60 

Lys Arg Tyr Asp Asp Val Leu Glu He Asp Pro Arg Gly* Val Gin Asn 
65 70 75 80 

He Pro Met Pro Tyr Gly Lys He Asn Ser Leu Arg Ala Ser Tyr Tyr 
85 90 95 



40 



Phe Tyr Gly Ser Leu Leu Gly Arg Phe Gly Glu Ala Thr Val Gly Leu 
45 100 105 110 

Pro Gly Gly Cys Asp Leu Gly Pro Arg Pro He Asp Leu His Leu Lys 
115 120 125 

50 Ala Phe Glu Ala Met Gly Ala Thr Ala Ser Tyr Glu Gly Asp Asn Met 
130 135 140 

Lys Leu Ser Ala Lys Asp Thr Gly Leu His Gly Ala Ser He Tyr Met 
145 150 155 160 

55 

Asp Thr Val Ser Val Gly Ala Thr He Asn Thr Met He Ala Ala Val 
165 170 175 
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Lys Ala Asn Gly Arg Thr lie lie Glu Asn Ala Ala Arg Glu Pro Glu 
180 185 190 

He He Asp Val Ala Thr Leu Leu Asn Asn Met Gly Ala His He Arg 
195 200 205 

Gly Ala Gly Thr Asn He He He He Asp Gly Val Glu Arg Leu His 
210 215 220 

Gly Thr Arg His Gin Val He Pro Asp Arg He Glu Ala Gly Thr Tyr 
225 230 235 240 

He Ser Leu Ala Ala Ala Val Gly Lys Gly He Arg He Asn Asn Val 
245 250 255 

Leu Tyr Glu His Leu Glu Gly Phe He .Ala Lys Leu Glu Glu Met .Gly 
260 265 , 270 

Val Arg Met Thr Val Ser Glu Asp Ser He Phe Val Glu Glu Gin Ser 
275 280 285 

Asn Leu Lys Ala He Asn He Lys Thr Ala Pro Tyr Pro Gly Phe Ala 
290 295 300 

Thr Asp Leu Gin Gin Pro Leu Thr Pro Leu Leu Leu Arg Ala Asn Gly 
305 310 315 320 

Arg Gly Thr He Val Asp Thr He Tyr Glu Lys Arg Val Asn His Val 
325 330 335 

Phe Glu Leu Ala Lys Met Asp Ala Asp lie Ser Thr Thr Asn Gly His 
340 345 350 

He Leu Tyr Thr Gly Gly Arg Asp Leu Arg Gly Ala Ser Val Lys Ala 
355 360 365 

Thr Asp Leu Arg Ala Gly Ala Ala Leu Val He Ala Gly Leu Met Ala 
370 375 380 

Glu Gly Lys Thr Glu He Thr Asn He Glu Phe He Leu Arg Gly Tyr 
385 390 395 400 

Ser Asp He He Glu Lys Leu Arg Asn Leu Gly Ala Asp He Arg Leu 
405 410 415 



Val Glu Asp 



<210> 157 
<211> 231 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 157 

Met Ser Arg He Glu Phe Ser Pro Ser. Leu Met Thr Met Asp Leu Asp 
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15 10 15 

Lys Phe Lys Glu Gin He Thr Phe Leu Asn Asp Lys Val Ala Ser Tyr 
20 25 30 

His He Asp He Met Asp Gly His Phe Val Pro Asn He Thr Leu Ser 
35 4.0- 45 

Pro Trp Phe He Gin Glu Val Gin Lys He Ser Asp Thr Pro Leu Ser 
50 55 60 

Val His Leu Met Val Thr Asp Pro Thr Phe Trp Val Asp Gin Val Leu 
65 70 75 80 

Asp Leu Gin Cys Glu Tyr He Cys He His Ala Glu Val Leu Asn Gly 
85 90 95 

Leu Ala Phe Arg Leu He Asp Lys He His Asp Ala Gly Leu Lys Ala 
100 105 110 

Gly Val Val Leu Asn Pro Glu Thr Pro Val Ser Thr He. Phe Pro Tyr 
115 120 125 

He Asp Leu Leu Asp Lys Val Thr He Met Thr Val Asp Pro Gly Phe 
130 135 140 

Ala Gly Gin Arg Phe Leu Glu Ser Thr Leu Tyr Lys He Gin Glu Leu 
145 150 155 160 

Arg Gin Leu Arg Val Gin Asn Gly Tyr His Tyr He He Glu Met Asp 
165 170 175 

Gly Ser Ser Ser Arg Lys Thr Phe Lys Gin He Asp Val Ala Gly Pro 
180 185 190 

Asp He Tyr Val He Gly Arg Ser Gly Leu Phe Gly Leu Asp Asp Asp 
195 200 205 

lie Ala Lys Ala Trp Asp He Cys Ser Arg Asp Tyr Glu Glu Met Thr 
210 . 215 220 

Gly Lys Thr Met Pro He Lys 
225 230 



<210> 158 
<211> 374 - 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 158 

Met Arg Asn Met Ala Leu Thr Ala Gly He Val Gly Leu Pro Asn Val 
1 5 10 15 

Gly Lys Ser Thr Leu Phe Asn Ala He Thr Lys Ala Gly Ala Glu Ala 
20 25 30 
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10 



15 



Ala Asn Tyr Pro Phe Ala Thr lie Asp Pro Asn Val Gly Met Val Glu 
35 40 45 

Asp Pro Asp Glu Arg Leu Gin Lys Leu Thr Glu Met lie Thr Pro Lys 
50 55 60 

Lys Thr Val Pro Thr Thr Phe Glu Phe Thr Asp lie Ala Gly lie Val 
65 70 75 80 

Lys Gly Ala Ser Lys Gly Glu Gly Leu Gly Asn Lys Phe Leu Ala Asn 
85 90 95 

He Arg Glu Val Asp Ala He Val His Val Val Arg Ala Phe Asp Asp 
100 105 HO 

Glu Asn Val Met Arg Glu Gin Gly Arg Glu Asp Ala Phe Val Asp Pro 
115 120 125 



Leu Ala Asp He Asp Thr He Asn Leu Glu Leu He Leu Ala Asp Leu 

20. 130 135 140 

Glu Ser Val Asn Lys Arg Tyr Ala Arg Val Glu Lys Met Ala Arg Thr 
145 150 155 160 

25 Gin Lys Asp Lys Glu Ser Val Ala Glu Phe Asn Val Leu Gin Lys He 

165 170 175 



30 



35 



40 



45 



50 



55 



Lys Pro Val Leu Glu Asp Gly Lys Ser Ala Arg Thr He Glu Phe Thr 
180 185 ' 190 

Asp Glu Glu Gin Lys Val Val Lys Gly Leu Phe Leu Leu Thr Thr Lys 
195 200 205 

Pro Val Leu Tyr Val Ala Asn Val Asp Glu Asp Val Val Ser Glu Pro 
210 215 220 

Asp Ser He Asp Tyr Val Lys Gin He Arg Glu Phe Ala Ala Thr Glu 
225 230 . 235 240 

Asn Ala Glu Val Val Val He Ser Ala Arg Ala Glu Glu Glu He Ser 
245 250 255 

Glu- Leu Asp Asp Glu Asp Lys Lys Glu Phe Leu Glu Ala He Gly Leu 
260 265 270 

Thr Glu Ser Gly Val Asp Lys Leu Thr Arg Ala Ala' Tyr His Leu Leu 
275 280 285 

Gly Leu Gly Thr Tyr Phe Thr Ala Gly Glu Lys Glu Val Arg Ala Trp 
290 295 300 

Thr Phe Lys Arg Gly Met Lys Ala Pro Gin Ala Ala Gly He He His 
305 310 315 . 320 

Ser Asp Phe Glu Lys Gly Phe He Arg Ala Val Thr Met Ser Tyr Glu 
325 330 335 
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10 



15 



Asp Leu Val Lys Tyr Gly Ser Glu Lys Ala Val Lys Glu Ala Gly Arg 
340. 345 350 

Leu Arg Glu Glu Gly Lys Glu Tyr He Val Gin Asp Gly * Asp He Met 
355 360 365 

Glu Phe Arg Phe Asn Val 
370 



<210> 159 
<211> 110 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 159 

Met- Glu He Glu Lys Thr Asn Arg Met Asn Ala Leu Phe Glu Phe Tyr 
1 5 10 15 



20 Ala Ala Leu Leu Thr Asp Lys Gin Met Asn Tyr He Glu Leu Tyr Tyr 
20 25 30 



25 



Ala Asp Asp Tyr Ser Leu Ala Glu He Ala Glu Glu Phe Gly Val Ser 
.35 40 45 

Arg Gin Ala Val Tyr Asp Asn He Lys Arg Thr Glu Lys lie Leu Glu 
50 55 60 



Asp Tyr Glu Met Lys Leu His Met Tyr Ser Asp Tyr He Val Arg Ser 
30 65 70 75 80 

Gin He Phe Asp Gin He Leu Glu Arg Tyr Pro Lys Asp Asp Phe Leu 
85 90 95 

35 Gin Glu Gib He Glu He Leu Thr Ser He Asp Asn Arg Glu 
■ 100 105 110 



<210> 160 
40 <211> 223 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 160 

45 Met Thr Leu Glu Trp Glu Glu Phe Leu Asp Pro Tyr He Gin Ala Val 
15 10 15 

Gly Glu Leu Lys He Lys Leu Arg Gly lie Arg Lys Gin Tyr Arg Lys 
20 25 30 

50 

Gin Asn Lys His Ser Pro He Glu Phe Val Thr Gly Arg Val Lys Pro 
35 40 45 

He Glu Ser lie Lys Glu Lys Met Ala Arg Arg Gly He Thr Tyr Ala 
55 50 . 55 60 

Thr Leu Glu His Asp Leu Gin Asp lie Ala Gly Leu Arg Val Met Val 
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65 70 75 80 

Gin Phe Val Asp Asp Val Lys Glu Val Val Asp lie Leu His Lys Arg 
85 90 95 

5 

Gin Asp Met Arg He He Gin Glu Arg Asp Tyr He Thr His Arg Lys 
100 105 110 

Ala Ser Gly Tyr Arg Ser Tyr His Val Val Val Glu Tyr Thr Val Asp 
10 115 12*0 125 

Thr He Asn Gly Ala Lys Thr He Leu Ala Glu lie Gin He Arg Thr 
130 135 140 

15 Leu Ala Met Asn Phe Trp Ala Thr He Glu His Ser Leu Asn Tyr Lys 
145 150 155 160 

Tyr Gin Gly Asp Phe Pro Asp Glu He Lys Lys Arg Leu Glu He Thr 
165. 170 175 

20 

Ala Arg He Ala His Gin Leu Asp Glu Glu Met Gly Glu He Arg Asp 
180 185 190 

Asp He Gin Glu Ala Gin Ala Leu Phe Asp Pro Leu Ser Arg Lys Leu 
25 195 200 205 

Asn Asp Gly Val Gly Asn Ser Asp Asp Thr Asp Glu Glu Tyr Arg 
210 215 " 220 

30 

<210> 161 
<211> 195 
<212> PRT- 

<213> Streptococcus pneumoniae 

35 

<400> 161 

Met Glu Leu Asn Thr His Asn Ala Glu He Leu Leu Ser Ala Ala Asn 
15 10 15 

40 Lys Ser His Tyr Pro Gin Asp Glu Leu Pro Glu He Ala Leu Ala Gly 
20 25 30 

Arg Ser Asn Val Gly Lys Ser Ser Phe He Asn Thr Met Leu Asn Arg 
35 40 45 

.45 

Lys Asn Leu Ala Arg Thr Ser Gly Lys Pro Gly Lys Thr Gin Leu Leu 
50 55 60 

Asn Phe Phe Asn He Asp Asp Lys Met Arg Phe Val Asp Val Pro Gly 
50 65 70 75 80 

Tyr Gly Tyr Ala Arg Val Ser Lys Lys Glu Arg Glu Lys Trp Gly Cys 
85 90 95 

55 Met He Glu Glu Tyr Leu Thr Thr Arg Glu Asn Leu Arg Ala Val Val 
. 100 105 HO 
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Ser Leu Val Asp Leu Arg His Asp Pro Ser Ala Asp Asp Val Gin Met 
115 120 125 

Tyr Glu Phe Leu Lys Tyr Tyr Glu lie Pro Val lie lie Val Ala Thr 
5 130 135 140 

Lys Ala Asp Lys lie Pro Arg Gly Lys Trp Asn Lys His Glu Ser Ala 
145 150 155 160 

10 lie Lys Lys Lys Leu Asn Phe Asp Pro Ser Asp Asp Phe lie Leu Phe 

165 170 175 



15 



40 



45 



Ser Ser Val Ser Lys Ala Gly Met Asp Glu Ala Trp Asp Ala lie Leu 
180 185 190 

Glu Lys Leu 
195 



20 <210> 162 
<211> 97 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 162 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 
15 10 15 

Val lie Asp Lys Arg Asp Leu Leu Arg lie Val Lys Asn Lys Glu Gly 
30 20 25 30 

Gin Val Phe lie Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 
35 40 45 

35 He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 
50 55 60 



Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 

Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



Cl* <210> 163 

<211> 103 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400fcr'163 

• Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser Lys 
55 1 5 10 15 

Tyr Ser Leu Val He Leu Glu Ala Lys Arg Ala His Glu Leu Glu Ala 
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20 



25 



30 



Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu Arg 
35 40 45 



Ala Leu Glu Glu He Glu Ser Gly Asn Val Thr He His Pro Asp Pro 
50 55 60 



Glu Gly Lys Arg Glu Ala Val Arg Arg Arg He Glu Glu Glu Lys Arg 

10 65 70 75 80 

Arg Lys Glu Glu Glu Glu Lys Lys He Lys Glu Gin He Ala Lys Glu 

85 90 95 



15 Lys Glu Asp Gly Glu Lys He 
100 



<210> 164 
20 <211> 103 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 164 

25 Met Ser Leu Thr Ser Lys Gin Arg Ala Phe Leu Asn Ser Gin Ala His 
1 5 10 15 

Thr Leu Lys Pro He He Gin He Gly Lys Asn Gly Leu Asn Asp Gin 
20 25 30 

30 . 

He Lys Thr Ser Val Arg Gin Ala Leu Asp Ala Arg Glu Leu He Lys 
35 40 45 

Val Thr Leu Leu Gin Asn Thr Asp Glu Asn He His Glu Val Ala Glu 
35 50 55 - 60 

He Leu Glu Glu Glu lie Gly Val Asp Thr Val Gin Lys He Gly Arg 
65 70 75 80 

40 He Leu He Leu Phe Lys Gin Ser Ser Lys Lys Glu Asn Arg Lys He 

85 90 95 



Ser Lys Lys Val Lys Glu He 
100 

45 

<210> 165 •- 
<211> 175 
<212> PRT 
50 <213> Streptococcus pneumoniae 

<400> 165 

Met Ala He Glu Asn Tyr He Pro Asp Phe Ala Val Glu Ala Val Tyr 
15 10 15 

55 

Asp Leu Thr Val Pro Ser Leu Gin Ala Gin Gly He Lys Ala Val Leu 
20 25 30 
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Val Asp Leu Asp Asn Thr Leu lie Ala Trp Asn Asn Pro Asp Gly Thr 
35 40 45 

5 Pro Glu Met Lys Gin Trp Leu His Asp Leu Arg Asp Ala Gly He Gly 
50 55 , 60 

He He Val Val Ser Asn Asn Thr Lys Lys . Arg Val Gin Arg Ala Val 
65 70 75 80 

10 

Glu Lys Phe Gly He Asp Tyr Val Tyr Trp Ala Leu Lys Pro Phe Thr 
85 90 95 

Phe Gly He Asp Arg Ala Met Lys Glu Phe His Tyr Asp Lys Lys Glu 
15 100 105 110 

Val Val Met Val Gly Asp Gin Leu Met Thr Asp lie Arg Ala Ala His 
115 • 120 125 

20 Arg Ala Gly He Arg Ser He Leu Val Lys Pro Leu Val Gin His Asp 
130 135 140 

Ser He Lys Thr Gin He Asn Arg Thr Arg Glu Arg Arg Val Met Arg 
145 150 155 160- 

25 

Lys He Thr Glu Lys Tyr Gly Pro He Thr Tyr Lys Lys Gly He 
165 170 175 



30 <210> 166 
<211> 455 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 166 

Met Phe Arg Lys He Leu He Ala Asn Arg Gly Glu He Ala Val Arg 
15 10 15 

He He Arg Ala Ala Arg Glu Leu Gly He Ala Thr Val Ala Val Tyr 
40 20 25 30 

Ser Thr Ala Asp Lys Glu Ala Leu His Thr Leu Leu Ala Asp Glu Ala 
• 35 '40 45 

45 Val Cys He Gly Pro Gly Lys Ala Thr Glu Ser Tyr Leu Asn He Asn 
50 55 60 

Ala Val Leu Ser Ala Ala Val Leu Thr Glu Ala Glu Ala He His Pro 
65 70 75 80 

50 

Gly* Phe Gly Phe Leu Ser Glu Asn Ser Lys Phe Ala Thr Met Cys Glu 
85 90 95 

Glu lie- Gly He Lys Phe He Gly Pro Ser Gly His Val Met Asp Met 
55 100 105 HO 

Met Gly Asp Lys He Asn Ala Arg Ala Gin Met He Lys Ala Gly Val 
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115 



120 



125 



Pro Val He Pro Gly Ser Asp Gly Glu Val His Asn Ser Glu Glu Ala 
130 135 140. 

Leu He Val Ala Glu Lys He Gly Tyr Pro Val Met Leu Lys Ala Ser 
145 150 155 160 

Ala Gly Gly Gly Gly Lys Gly He Arg Lys Val Glu Lys Pro Asp Asp 
165 170 175 

Leu Val Ser Ala Phe Glu Thr Ala Ser Ser Glu Ala Lys Ala Asn Tyr 
180 185 190 

Gly Asn Gly Ala Met Tyr He Glu Arg Val He Tyr Pro Ala Arg His 
195 200 205 

He Glu Val Gin He Leu Gly Asp Glu His Gly His Val He His Leu 
210 215 220 

Gly Glu Arg Asp Cys Ser Leu Gin Arg Asn Asn Gin Lys Val Leu Glu 
225 230 ■ 235 * 240 

Glu Ser Pro Ser He Ala He Gly Lys Thr Leu Arg His Glu He Gly 
245 250 255 

Ala Ala Ala Val Arg Ala Ala Glu Phe Val Gly Tyr Glu Asn Ala Gly 
260 265 270 

Thr He Glu Phe Leu Leu Asp Glu Ala Ser Ser Asn Phe Tyr Phe Met 
275 280 . 285 

Glu Met Asn Thr Arg Val Gin Val Glu His Pro Val Thr Glu Phe Val 
290 295 300 

Ser Gly Val Asp He Val Lys Glu Gin He Cys He Ala Ala Gly Gin 
305 310 315 320 

Pro Leu Ser Val Lys Gin Glu Asp He Val Leu Arg Gly His Ala He 
325 330 335 

Glu Cys Arg He Asn Ala Glu Asn Pro Ala Phe Asn Phe Ala Pro Ser 
340 345 350 

Pro Gly Lys He Thr Asn Leu Tyr Leu Pro Ser Gly Gly Val Gly Leu 
355 360 365 

Arg Val Asp Ser Ala Val Tyr Pro Gly Tyr Thr .He Pro Pro Tyr Tyr 
370 375 380 

Asp Ser Met He Ala Lys He He Val His Gly Glu Asn Arg Phe Asp 
385 390 395 400 * 

Ala Leu Met Lys Met Gin Arg Ala Leu Tyr Glu Leu Glu He Glu Gly 
405 410 ' 415 



Val 'Gin Thr Asn Ala Asp Phe Gin Leu Asp Leu He Ser Asp Arg Asn 
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420 



425 



430 



Val lie Ala Gly Asp Tyr Asp Thr Cys Phe Leu Met Glu Thr Phe Leu 
435 440 445 



Pro Lys Tyr Gin Glu Lys Glu 
450 455 



<210> 167 
<211> 77 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 167 

Met lie Tyr Lys Val Phe Tyr Gin 
1 5 

Arg Glu Thr Thr Arg Ala Leu Tyr 
20 

Leu Glu Gly Arg lie Thr Ala Arg 
35 40 

Glu Tyr Asn lie Glu Tyr He Glu 
50 55 

Tyr Glu Lys Glu Thr Gly Ala Phe 
65 70 



Glu Thr Lys Glu Arg Ser. Pro Arg 
10 15 

Leu Asp He Asp Thr Ser Ser Glu 
25 30 

Gin Leu Val Glu Glu Asn Arg Pro 
45 

Leu Leu Ser Asp Lys Leu Leu Asp 
60 

Glu He Thr Glu Phe 
75 



<210> 168 
<211> 336 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 168 

Met Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr 
1 .'5 . 10 15 

Ser Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val He 
20 25 30 

Ala Ser Gin He Glu* Ser His Lys Arg Phe Gly Gly Val Val Pro Glu 
35 40 45 

Val Ala Ser Arg His His Val Glu Val He Thr Ala Cys He Glu Glu 

50 55 '■; • 60 

Ala Leu Ala Glu Ala Gly He Thr Glu Glu Asp Val Thr Ala Val Ala 
65 70 75 80 

Val Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser 
85 90 95 

Ala Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu He Pro Val 
100 105 110 
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Asn His Met Ala 
115 

Glu Phe Pro Leu 
130 



Gly His Leu Met 
120 

Leu Ala Leu Leu 
135 



Ala Ala Gin Ser 

Val Ser Gly Gly 
140 



Val Glu Pro Leu 
125 

His Thr Glu Leu 



Val Tyr Val Ser Glu Ala Gly Asp 
145 150 

Asp Asp Ala Val Gly Glu Ala Tyr 
165 

Leu Thr Tyr Pro Ala Gly Arg Giu 
180 

Gin Asp lie Tyr Asp Phe Pro Arg 
195 200 

Glu Phe Ser Phe Ser Gly Leu Lys 
210 215 



Tyr Lys lie Val Gly Glu Thr Arg 
155 1 160 

Asp Lys Val Gly Arg Val Met Gly 
170 175 

He Asp Glu Leu Ala His Gin Gly 
185 190 • 

Ala Met He Lys Glu Asp Asn Leu 
205 

Ser Ala Phe lie Asn Leu His His 
220 



Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala 
225 230 . 235 240 

Ser Phe Gin Ala Ala Val Met Asp He "Leu Met Ala Lys Thr Lys Lys 
245 250 255 



Ala Leu Glu Glu Tyr Pro Val Lys Thr Leu Phe Val Ala Gly Gly Val 
260 265 270 

Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu He Thr Asp 
275 280 285 

Val Lys Val He He Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly 
■290 295 300 



Met He Ala Tyr Ala Ser Val Ser Glu Trp. Asn Lys Glu Asn Phe Ala 

305 310 315 320 

Gly Trp Asp Leu Asn. Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 

325 330 335 



<210> 169 
<211> 602 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 169 

Met Cys Gly He Val Gly Val Val Gly Asn Thr Asn Ala Thr Asp lie 
1 5 . 10 15 

Leu He Gin Gly Leu Glu Lys Leu Glu Tyr Arg Gly Tyr Asp Ser Ala 
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20 



25 



30 



Gly He Phe Val Leu Asp Gly Ala Asp Asn His Leu Val Lys Ala Val 
35 40 45 

Gly Arg He Ala Glu Leu Ser Ala Lys Thr Ala Gly Val Glu Gly Thr 
50 55 60 

Thr Gly He Gly His- Thr Arg Trp Ala Thr His Gly Lys Pro Thr Glu 
65 70 75 80 

Asp Asn Ala His Pro His Arg Ser Glu Thr Glu Arg Phe Val Leu Val 
85 90 95 

His Asn Gly Val He Glu Asn Tyr Leu Glu He Lys Glu Glu Tyr Leu 
100 105 110 

Ala Gly His His Phe Lys Gly Gin Thr Asp Thr Glu He Ala Val His 
115 120 125 

Leu lie Gly Lys Phe Ala Glu Glu Glu Gly Leu Ser Val Leu Glu Ala 
• 130 135 140 

Phe Lys Lys Ala Leu His He He Arg Gly Ser Tyr Ala Phe Ala Leu 
145 150 155 160 

He Asp Ser Glu Asn Pro Asp Val He Tyr Val Ala Lys Asn Lys Ser 
165 170 175 

Pro Leu Leu He Gly Leu Gly Glu Gly Tyr Asn Met Val Cys Ser Asp 
180 185 190 

Ala Met Ala Met He Arg Glu Thr Asn Gin Tyr Met Glu lie His Asp 
195 200 205 

Gin Glu Leu Val He Val Lys Ala Asp' Ser Val Glu Val Gin Asp Tyr 
210 215 220 

Asp Gly Asn Ser Arg Glu Arg Ala Ser Tyr Thr Ala Glu Leu Asp Leu 
225 230 235 240.. 

Ser Asp lie Gly Lys Gly Thr Tyr Pro Tyr Tyr Met Leu Lys Glu lie 
245 250 • 255 

Asp Glu Gin Pro Thr Val Met Arg Lys Leu lie Gin Ala Tyr Thr Asp 
260 265 270 

Asp Ala Gly Gin Val Val Val Ala Pro Ala lie lie Lys Ala Val Gin 
275 280 285 

Asp Ala Asp Arg He Tyr lie Leu Ala Ala Gly Thr Ser Tyr His Ala 
290 295 300 

Gly Phe-Ala Ser Lys Lys Met Leu Glu Glu Leu Thr Asp Thr Pro Val 
305 310 315 . 320 



Glu Leu Gly lie Ser Ser Glu Trp Gly Tyr Gly Met Pro Leu Leu Ser 
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325 

Lys Lys Pro Leu Phe lie Phe 
340 

5 

Ser Arg Gin Val Leu Val Lys 
355 

Thr Val Thr Asn Val Pro Gly 
10 370 375 

Thr Met Leu Leu His Ala Gly 
385 390 

15 Ala Tyr Thr Ala Gin He Ala 

405 



330 335 

He Ser Gin Ser Gly Glu Thr Ala Asp 
345 350 

Ala Asn Glu Met Gly He Pro Ser Leu 
360 365 * 

Ser Thr Leu Ser Arg Glu Ala Asn Tyr 
380 

Pro Glu He Ala Val Ala Ser Thr Lys 

395 400 

Ala Leu Ala Phe Leu Ala Lys Ala Val 
410 415 



20 



Gly Glu Ala Asn Gly Asn Ala Lys Ala Gin Ala Phe Asp Leu Val His 

420 425 430 

Glu Leu Ser He Val Ala Gin Ser He Glu Ser Thr Leu Ser Glu Lys 

435 440 445 



Glu Thr He Glu Ala Lys Val Arg Glu Leu Leu Glu Thr Thr Arg Asn 
25 450 " 455 460 

Ala Phe Tyr He Gly Arg Gly Gin Asp Tyr Tyr Val Ala Met Glu Ala 

465 470 475 480 

30 Ser Leu Lys Leu Lys Glu He Ser Tyr He Gin Cys Glu Gly Phe Ala 

485 490 495 



35 



Ala Gly Glu Leu Lys His Gly Thr He Ala Leu He Glu Glu Gly Thr 

500 505 510 

Pro Val Leu Ala Leu Leu Ser Asp Pro Val Leu Ala Asn His Thr Arg 

515 520 525 



Gly Asn He Gin Glu Val Ala Ala Arg Gly Ala Lys Val Leu Thr He 
40 530 535 540 

Ala Glu Glu Asn Val Ala Lys Asp Thr Asp Asp He Val Leu Thr Thr 

545 550 * 555 560 

45 Val His Pro Tyr Leu Ser Pro He Ser Met Val Val Pro Thr Gin Leu 

565 570 575 



50 



Val Ala Tyr Phe Ala Thr Leu His Arg Gly Leu Asp Val Asp Lys Pro 
580 58.5 590 

Arg Asn Leu Ala Lys Ser Val Thr Val Glu 
595 600 



55 <210> 170 
<211> 240 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 170 

Met lie Arg lie Glu Asn Leu Ser Val Ser Tyr Lys Glu Thr Leu Ala 
5 1 5 10 15 

Leu Lys Asp lie Ser Leu Val Leu His Gly Pro Thr lie Thr Gly lie 
20 25 30 

10 lie Gly Pro Asn Gly Ala Gly Lys Ser Thr Leu Leu Lys Gly Met Leu 
35 40 45 



15 



30 



45 



50 



Gly lie He Pro His Gin Gly Gin Ala Phe Leu Asp Asp Lys Glu Val 

50 55 60 

Lys Lys Ser Leu His Arg He Ala Tyr Val Glu Gin Lys He Asn He 

65 70 75 80 



Asp Tyr Asn Phe Pro He Lys Val Lys Glu Cys Val Ser Leu Gly Leu 
20 85 90 95 

Phe Pro Ser He Pro Leu Phe Arg Ser Leu Lys Ala Lys His Trp Lys 
100 105 110 

25 Lys Val Gin Glu Ala Leu Glu He Val Gly Leu Ala Asp Tyr Ala Glu 
115 120 125 



Arg Gin He Ser Gin Leu Ser Gly Gly Gin Phe Gin Arg Val Leu He 
130 135 140 

"Ala Arg Cys Leu Val Gin Glu Ala Asp Tyr He Leu Leu Asp Glu Pro 

145 150 155 160 



Phe Ala Gly lie Asp Ser Val Ser Glu Glu He He Met Asn Thr Leu 
35 165 170 175 

Arg Asp Leu Lys Lys Ala Gly Lys Thr Val Leu He Val His His Asp 
180 185 190 

40 Leu Ser Lys He Pro His Tyr Phe Asp Gin Val Leu Leu Val Asn Arg 
195 200 205 



Glu Val He Ala Phe Gly Pro Thr Lys Glu Thr Phe Thr Glu Thr Asn 
210. 215 220 

Leu Lys Glu Ala Tyr Gly Asn Gin Leu Phe Phe Asn Gly Gly Asp Leu 
225 230 - 235 240 



<210> 171 
<211> 740 
55 <212> PRT 

<213> Streptococcus 



pneumoniae 
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<400> 171 

Met Pro Lys Glu Val Asn Leu Thr Gly Glu Glu Val Val Ala Leu Thr 
15 10 15 

Lys Glu Tyr Leu Thr Glu Glu Asp Val His Phe Val His Lys Ala Leu 
20 2.5 30 

Val Tyr Ala Val Glu Cys His Ser Gly Gin Tyr Arg Lys Ser Gly Glu 
35 40 45 

Pro Tyr lie lie His Pro lie Gin Val Ala Gly He Leu Ala Lys Leu 
50-55 60 

Lys Leu Asp Ala Val Thr Val Ala Cys Gly Phe Leu His Asp Val Val 
65 70 75 80 

Glu Asp Thr Asp Ala Thr Leu Asp Asp Leu Glu Arg Glu Phe Gly Pro 
85 90 95 

Asp Val Arg Val He Val Asp Gly Val Thr Lys Leu Gly Lys Val Glu 
100 105 110 

Tyr Lys Ser He Glu Glu Gin Leu Ala Glu Asn His Arg Lys Met Leu 
115 120 125 

Met Ala Met Ser Glu Asp- He Arg Val lie Leu Val Lys Leu Ser Asp 
130 135 140 

Arg Leu His Asn Met Arg Thr Leu Lys His Leu Arg Lys Asp Lys Gin 
145 150 155 160 

Glu Arg He Ser Lys Glu Thr Met Glu He Tyr Ala Pro Leu Ala His 
165 170 175 

Arg Leu Gly He Ser Ser Val Lys Trp Glu Leu Glu Asp Leu Ser Phe 
180 185 190 

Arg Tyr Leu Asn Pro Thr Glu Phe Tyr Lys He Thr His Met Met Lys 
195 200 205 

Glu Lys Arg Arg Glu Arg Glu Ala Leu Val Asp Glu Val Val Thr Lys 
210 215 220 

Leu Glu Glu Tyr Thr Thr Glu Arg His Leu Lys Gly Lys He Tyr Gly 
225 230 235 240 

Arg Pro Lys His He Tyr Ser He Phe -Arg Lys Met Gin Asp Lys Arg 
245 250 255 

Lys Arg Phe Glu Glu He Tyr Asp Leu He Ala He Arg Cys He Leu 
260 265 270 

Asp Thr Gin Ser Asp Val Tyr Ala Met Leu Gly Tyr Val His Glu Phe 
275 280 285 

Trp Lys Pro Met Pro Gly Arg Phe Lys Asp Tyr He Ala Asn Arg Lys 
290 295 300 
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Ala Asn Gly Tyr Gin Ser lie His 
305 310 

5 Pro lie Glu Phe Gin lie Arg Thr 

325 

Tyr Gly Val Ala Ala His Trp Ala 
340 

10 

Val Asn Ser Lys Glu Ser Ala lie 
355 360 



Thr Thr Val Tyr Gly Pro Lys Gly 
315 320 

Lys Glu Met His Glu Val Ala Glu 
330 335 

Tyr Lys Lys Gly lie Lys Gly Gin 
345 350 

Gly Met Asn Trp lie Lys Glu Met 
365 



Met Glu Leu Gin Asp Gin Ala Asp Asp Ala Lys .Glu Phe Val Asp Ser 

15 370' 375 380 

Val Lys Glu Asn Tyr Leu Ala Glu Glu lie Tyr Val Phe Thr Pro Asp 

385 390 395 400 

20 Gly Ala. Val Arg Ser Leu Pro Lys Asp Ser Gly Pro lie Asp Phe Ala 

405 410 . 415 



25 



Tyr Glu lie His Thr Lys Val Gly Glu Lys Ala Thr Gly Ala Lys Val 

420 425 430 

Asn Gly Arg Met Val Pro Leu Thr Thr Lys Leu Lys Thr Gly Asp Gin 

435 440 445 



Val Glu lie lie Ala Asn Pro Asn Ser Phe Gly Pro Ser Arg Asp Trp 
30 450 455 460 

Leu Asn Met Val Lys Thr Ser Lys Ala Arg Asn Lys lie Arg Gin Phe 
465 470 475 480 

35 Phe Lys Asn Gin Asp Lys Glu Leu Ser Val Asn Lys Gly Arg Glu Met 

485 490 495 



40 



Leu Met Ala Gin Phe Gin Glu Asn Gly Tyr Val Ala Asn Lys Phe Met 
500 505 510 

Asp Lys Arg His Met Asp Gin Val Leu Gin Lys Thr Ser Tyr Lys Thr 
515 520 525 



Glu Asp Ser Leu Phe Ala Ala lie Gly Phe Gly Glu lie Gly Ala He 
45 530 . 535 540 

Thr Val Phe Asn Arg Leu Thr Glu Lys Glu Arg Arg Glu Glu Glu Arg 

545 , 550 ■ - - 555 560 

50 Ala Lys Ala Lys Ala Glu Ala Glu Glu Leu Val Lys Gly Gly Glu Val 

565 570 575 



55 



Lys Val Glu Asn Lys Glu Thr Leu Lys Val Lys His Glu Gly Gly Val 
580 585 590 

Val He Glu Gly Ala Ser Gly Leu Leu Val Arg He Ala Lys Cys Cys 

595 600 605 
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Asn Pro Val Pro Gly Asp Asp lie Val Gly Tyr lie Thr Lys Gly Arg 
610 615 620 

Gly Val Ala lie His Arg Val Asp Cys Met Asn Leu Arg Ala Gin Glu 
625 -630 635 640 

Asn Tyr Glu Gin Arg Leu Leu Asp Val Glu Trp Glu Asp Gin Tyr Ser 
645 650 655 

Ser Ser Asn Lys Glu Tyr Leu Ala His lie Asp lie Tyr Gly Leu Asn 
660 665 670 

Arg Thr Gly Leu Leu Asn Asp Val Leu Gin Val Leu Ser Asn Thr Thr 
675 680 685 

Lys Asn lie Ser Thr Val Asn Ala Gin Pro Thr Lys Asp Met Lys Phe 
690 695 700 

Ala Asn lie His Val Ser Phe Gly lie Ala Asn Leu Ser Thr Leu Thr 
705 710 715 720 

Thr Val Val Asp Lys lie Lys Ser Val Pro Glu Val Tyr Ser Val Lys 
725 730 735 



Arg Thr Asn Gly 
740 



<210> 172 
<211> 492 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 172 

Met Ser Asn Trp Asp Thr Lys Phe Leu Lys Lys Gly Phe Thr Phe Asp " 
1 5 .10 15 

Asp Val Leu Leu lie Pro Ala Glu Ser His Val Leu Pro Asn- Asp Ala 
20 25 30 

Asp Leu Thr Thr Lys Leu Ala Asp Asn Leu Thr Leu Asn He Pro He 
35 40 45 

He Thr Ala Ala Met Asp Thr Val Thr Glu Ser Gin Met Ala He Ala 
50 .55 60 

lie Ala Arg Ala Gly Gly Leu Gly Val He His Lys Asn Met Ser He 
65 70 75 80 

Ala Gin Gin Ala Asp Glu Val - Arg Lys Val Lys Arg Ser Glu Asn Gly 
85 90 95 * 

Val He He Asp Pro Phe Phe Leu Thr Pro Glu His Thr He Ala Glu 
100 105 110 

Ala "Asp Glu Leu Met Gly Arg Tyr Arg He Ser Gly Val Pro Val Val 
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115 



120 



125 



Glu Thr Leu Glu Asn Arg Lys Leu Val Gly lie Leu Thr Asn Arg Asp 
130 " 135 140 

5 

Leu Arg Phe He Ser Asp Tyr Asn Gin Pro He Ser Asn His Met Thr 
145 150 155 160 

Ser Glu Asn Leu Val Thr Ala Pro Val Gly Thr Asp Leu Ala Thr Ala 
10 165 170 175 

Glu Ser He Leu Gin Glu His Arg He .Glu Lys Leu Pro Leu Val Asp 
180 185 190 

15 Glu Glu Gly Ser Leu Ser Gly Leu He Thr He Lys Asp He Glu Lys 
195 200 205 



20 



Val He Glu Phe Pro Asn Ala Ala Lys Asp Glu Phe Gly Arg Leu Leu 

210 215 - 220 

Val Ala Gly Ala Val Gly Val Thr Ser Asp Thr Phe Glu Arg Ala Glu 

225 230 235 240 



Ala Leu Phe Glu Ala Gly Ala Asp Ala lie Val He Asp Thr Ala His 
25 245 250 255 

Gly His Ser Ala Gly Val Leu Arg Lys He Ala Glu He Arg Ala His 
260 265 270. 

30 Phe Pro Asp Arg Thr Leu lie Ala Gly Asn lie Ala Thr Ala Glu Gly 
275 280 285 



35 



Ala Arg Ala Leu Tyr Glu Ala Gly Val Asp Val Val Lys Val Gly lie 

290 295 300 

Gly Pro Gly Ser He Cys Thr Thr Arg Val He Ala Gly Val Gly Val 

305 310 315 * 320 



Pro Gin Val Thr Ala lie Tyr Asp Ala Ala Ala Val Ala Arg Glu Tyr 
40 325 330 335 

Gly Lys Thr lie lie Ala Asp Gly Gly lie Lys Tyr Ser Gly Asp lie 
340 345 350 

45 Val Lys Ala Leu Ala Ala Gly Gly Asn Ala Val Met Leu Gly Ser Met 
355 360 365 



50 



Phe Ala Gly Thr Asp Glu Ala Pro Gly Glu Thr Glu He Phe Gin Gly 
370 375 380 

Arg Lys Phe Lys Thr Tyr Arg Gly Met Gly Ser lie Ala Ala Met Lys 

385 390 395, 400 



Lys Gly Ser Ser Asp Arg Tyr Phe Gin Gly Ser Val Asn Glu Ala Asn 
55 405 410 415 

Lys Leu Val Pro Glu Gly lie Glu Gly Arg Val Ala Tyr Lys Gly Ala 
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420 



425 



430 



Ala Ala Asp He Val Phe Gin Met He Gly Gly lie Arg Ser Gly Met 

435 440 • 445 

Gly Tyr Cys Gly Ala Ala Asn Leu Lys Glu Leu His Asp Asn Ala Gin 
450 455 460 

Phe He Glu Met Ser Gly Ala Gly Leu Lys Glu Ser His Pro His Asp 

465 470 475 480 



Val Gin He Thr Asn Glu Ala Pro Asn Tyr Ser Met 
485 490 



<210> 173 
<211> 648 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 173 

Met Thr Glu Glu He Lys Asn Leu Gin Ala Gin Asp Tyr Asp Ala Ser 
15 10 15 

Gin He Gin Val Leu Glu Gly Leu Glu Ala Val Arg Met Arg Pro Gly 
20 25 30 

Met Tyr He Gly Ser Thr Ser Lys Glu Gly Leu His His Leu Val Trp 
35 40 4 5 

Glu He Val Asp Asn Ser He Asp Glu Ala Leu Ala Gly Phe Ala Ser 
50 55 60 

His He Gin Val Phe He Glu Pro Asp Asp Ser He Thr Val Val Asp 
65 70 75 80 

Asp Gly Arg Gly He Pro Val Asp He Gin Glu Lys Thr Gly Arg Pro 
85 90 95 

Ala Val Glu Thr Val Phe Thr Val Leu His Ala Gly Gly Lys Phe Gly 
100 105 110 

Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu His Gly Val Gly Ser Ser 
115 120 125 

Val Val Asn Ala Leu Ser Thr Gin Leu Asp Val His Val His Lys Asn 
130 135 140 

Gly Lys He His Tyr Gin Glu Tyr Arg Arg Gly His Val Val Ala Asp 
145 150 155 . 160 

Leu Glu lie Val Gly Asp Thr Asp Lys Thr Gly Thr Thr Val His Phe 
165 170 175 

Thr Pro Asp Pro Lys He Phe Thr Glu Thr Thr He Phe Asp Phe Asp 
180 185 190* 
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Lys Leu Asn Lys 
195 

Gin He Ser He 
210 

Tyr His Tyr Glu 
225 

Asn Lys Asp Val 



Asp Asp He Thr 
260 

Glu Asn Val Met 
275 

Thr His Glu Gin 
290 

Tyr Ala Arg Lys 
305 

Gly Glu Asp Val 



Pro Asn Pro Gin 
340 

Glu Val Val Lys 
355 

Phe Leu Met Glu 
370 

He. Leu Ala Ala 
385 

Thr Arg Lys Lys 



Ala Asp Cys Ser 
420 

Glu Gly Asp Ser 
435 

Phe Gin Ala He 
450 

Ala Ser Met Asp 
4 65 

Thr Ala Met Gly 



Arg He Gin Glu Leu Ala Phe Leu Asn Arg Gly Leu 
200 205 

Thr Asp Lys Arg Gin Gly Leu Glu Gin Thr Lys His 
215 220 

Gly Gly He Ala Ser Tyr Val Glu Tyr He Asn Glu 
230 235 240 

lie Phe Asp Thr Pro lie Tyr Thr Asp Gly Glu Met 
245 250 255 



Val Glu Val Ala Met Gin 
265 

Ser Phe Ala Asn Asn He 
280 



Phe Glu Gly Gin Thr Lys 
345 

He Thr Asn Arg Leu Phe 
360 ' 



Tyr Thr Thr Gly Tyr His 
270 

His Thr His Glu Gly Gly 
285 

Arg Val He Asn Asp 
300 

Glu Asp Asn Leu Thr 
320 

He Ser Val Lys His 
. 335 

Thr Lys Leu Gly Asn Ser 
350 

Ser Glu Ala Phe Ser Asp 
365 



Gly Phe Arg Thr Ala Leu Thr 
295 

Asn Lys Leu Leu Lys Asp Asn 
310 315 

Arg Glu Gly Leu Thr Ala Val 
325 330 



Asn Pro Gin He Ala Lys Arg He Val Glu Lys Gly 
375 380 

Lys Ala Arg Val Ala Ala Lys Arg Ala Arg Glu Val 
390 395 400 

Ser Gly Leu Glu He Ser Asn Leu Pro Gly Lys Leu 
405 410 415 

Ser Asn Asn Pro Ala Glu Thr Glu Leu Phe He Val 
425 430 

Ala Gly Gly Ser Ala Lys Ser Gly Arg Asn Arg Glu 
440 445 

Leu Pro He Arg Gly Lys He Leu Asn Val Glu Lys 
455 460 • 

Lys He Leu Ala Asn Glu Glu He Arg Ser Leu Phe 
470 475 480 

Thr Gly Phe Gly Ala Glu Phe Asp Val Ser Lys Ala 
485 490 495 
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Arg Tyr Gin Lys Leu Val Leu Met Thr Asp Ala Asp Val Asp Gly Ala 
500 505 510 

His lie Arg Thr Leu Leu Leu Thr Leu lie Tyr Arg Tyr Met Lys Pro 
5 515 520 525 

lie Leu Glu Ala Gly Tyr Val Tyr He Ala Gin Pro Pro He Tyr Gly 
530 535 540 

10 Val Lys Val Gly Ser Glu He Lys Glu Tyr He Gin Pro Gly Ala -Asp 
545 550 555 560 

Gin Glu He Lys Leu Gin Glu Ala Leu Ala Arg Tyr Ser Glu Gly Arg 
565 570 575 

15 

Thr Lys Pro Thr He Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Asp 
580 585 590 

His Gin Leu Trp Glu Thr Thr Met Asp Pro Glu His Arg Leu Met Ala 
20 595 600 605 

Arg Val Ser Val Asp Asp Ala Ala Glu Ala Asp Lys He Phe Asp . Met 
610 . 615 620 

25 Leu Met Gly Asp Arg Val Glu Pro Arg Arg Glu Phe He Glu Glu Asn 
625 630 635 640 

Ala Val Tyr Ser Thr Leu Asp Val 
645 

30 

<210> 174 
<211> 88 
<212> PRT 
35 <213> Streptococcus pneumoniae 

<400> 174 

Met Gly Phe Thr Glu Glu Thr Val Arg Phe Lys Leu Asp Asp Ser Asn 
15 10 15 

40 

Lys Lys Glu He Ser Glu Thr Leu Thr Asp Val Tyr Ala Ser Leu Asn 
20 25 30 

Asp Lys Gly Tyr Asn Pro He Asn Gin He Val Gly Tyr Val Leu Ser 
45 35 40 45 

Gly Asp Pro Ala Tyr Val Pro Arg* Tyr- Asn Asn Ala Arg Asn Gin He 
50 55 60 

50 Arg Lys Tyr Glu Arg Asp Glu He Val Glu Glu Leu Val Arg Tyr Tyr 
65 70 75 80 

Leu Lys Gly Gin Gly Val Asp Leu 
85 

55 

<210> 175 
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<211> 198 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 175 

Met Val Asn Tyr Pro His Lys Val Ser Ser Gin Asp Arg Gin Thr Ser 
1 5 10 15 

Leu Ser Gin Pro Lys Asn Phe Ala Asn Arg Gly Met Ser Phe Glu Lys 
20 25 30 

Met lie Asn Ala Thr Asn Asp Tyr Tyr Leu Ser Gin Gly Leu Ala Val 
35 40 45 

lie His Lys Lys Pro Thr Pro lie Gin He Val Gin Val Asp Tyr Pro 
50 55 60 

Gin Arg Ser Arg Ala Lys lie Val Glu Ala Tyr Phe Arg Gin Ala Ser 
65 70 75 80 

Thr Thr Asp Tyr Ser Gly Val Tyr Asn Gly Tyr Tyr He Asp Phe Glu 
85 90 95 

Val Lys Glu Thr Lys Gin Lys Arg Ala He Pro Met Lys Asn Phe His 
100 105 110 

Pro His Gin He Gin His Met Glu Gin Val Leu Ala Gin Gin Gly He 
115 120 125 * 

Cys Phe Val Leu Leu His Phe Ser Ser Gin Gin Glu Thr Tyr Leu Leu 
130 135 140 

Pro Ala Phe Asp Leu lie Arg Phe Tyr His Gin Asp Lys Gly Gin Lys 
145 150 155 160 

Ser Met Pro Leu Glu Tyr He Arg Glu Tyr Gly Tyr Glu He Lys Ala 
165' 170" 175 

Gly Ala Phe Pro Gin He Pro Tyr Leu Asn Val He Lys Glu His Leu 
180 185 190 

Leu Gly Gly Lys Thr Arg 
195 



<2'10> 176 

<211> 288 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 176 

Met Ala Leu Phe Ser Lys Lys Asp Lys Tyr lie Arg lie Asn Pro Asn 
. 1 5 10 15 

Arg Ser Val Arg Glu Lys Pro Gin Ala Lys Pro Glu Val Pro Asp Glu 
20 25 30 
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Leu Phe Ser Gin Cys Pro Gly Cys Lys His Thr lie Tyr Gin Lys Asp 
35 40 45 

Leu Gly Ser Glu Arg He Cys Pro His Cys Ser Tyr Thr Phe Arg He 
5 50 55 60 

Ser Ala Gin Glu Arg Leu Ala Leu Thr He Asp Met Gly Thr Phe Lys 
65 70 75 80 

10 Glu Leu Phe Thr Gly He Glu Ser Lys Asp Pro Leu His Phe Pro Gly 

85 90 95 

Tyr Gin Lys Lys Leu Ala Ser Met Arg Glu Lys Thr Gly Leu His Glu 
100 105 110 . 

15 

Ala Val Val Thr Gly Thr Ala Leu He Lys Gly Gin Thr Val Ala Leu 
115 120 125 

Gly He Met Asp Ser Asn Phe He Met Ala Ser Met Gly Thr Val Val 
20 130 135 140 

Gly Glu Lys He Thr Arg Leu Phe Glu Tyr Ala Thr Val Glu Lys Leu 
145 150 155 160 

'25 Pro Val Val Leu Phe Thr Ala Ser Gly Gly Ala Arg Met Gin Glu Gly 

165 170 175 

He Met Ser Leu Met Gin Met Ala Lys He Ser Ala Ala Val Lys Arg 
180 185 190 

30 

- His Ser Asn Ala Gly Leu Phe Tyr Leu Thr He Leu Thr Asp Pro Thr 
195 200 205 

Thr Gly Gly Val Thr Ala Ser Phe Ala Met Glu Gly Asp He He Leu 
35 210 215 220 

Ala Glu Pro Gin Ser Leu Val Gly Phe Ala Gly Arg Arg Val He Glu 
225 230 235 240 

40 Asn Thr Val Arg Glu Ser Leu Pro Glu Asp Phe Gin Lys Ala Glu Phe 

245 250 255 

Leu Leu Glu His Gly Phe Val Asp Ala He Val Lys Arg Arg Asp Leu 
260 265 270 

45 

Pro Asp Thr He Ala Ser Leu Val Arg Leu His Gly Gly Ser Pro Arg 
275 280 285 



50 



<210> 177 
<211> 139 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 177 

Met Arg He Met Gly Leu Asp Val Gly Ser Lys Thr Val Gly Val Ala 
1 5 10 15 

He Ser Asp Pro Leu Gly Phe Thr Ala "Gin Gly Leu Glu He He Gin 
20 25 30 

He Asn Glu Glu Gin Gly Gin Phe Gly Ser Asp Arg Val Lys Glu Leu 
35 40 45 

Val Asp Thr Tyr Lys Val Glu Arg Phe Val Val Gly Leu Pro Lys Asn 
50 55 60 

Met Asn Asn Thr Ser Gly Pro Arg Val Glu Ala Ser Gin Ala Tyr Gly 
65 70 75 80 

Ala Lys Leu Glu Glu Phe Phe Gly Leu Pro Val Asp Tyr Gin Asp Glu 
85 90 95 

Arg Leu Thr Thr Val Ala Ala Glu Arg Met Leu He Glu Gin Ala Asp 
100 105 110 

He Ser Arg Asn Lys Arg Lys Lys Val He Asp Lys Leu Ala Ala Gin 
115. 120 ( 125 

Leu He Leu Gin Asn Tyr Leu Asp Arg Lys Phe 
130 135 



<210> 178 
- <211> 398 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 178 

Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Gly Lys Lys Val 
15 10 15 

Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val He Thr 
20 25 30 

Asn Asp Asn Arg He Thr Ala Ala Leu Pro Thr He Lys Tyr He He 
35 40 45 

Glu Gin Gly Gly Arg Ala He Leu Phe Ser His Leu Gly Arg Val Lys 
50 55 60 

Glu Glu Ala Asp Lys Ala Gly Lys Ser Leu Ala Pro. Val Ala Ala Asp 
65 70 75 80 

Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arg 
85 90 95 

Gly Ala Glu Leu Glu Ala Ala He Asn Ala Leu Glu Asp Gly Gin Val 
100 105 110 

Leu Leu Val Glu Asn Thr Arg Tyr Glu Asp Val Asp Gly Lys Lys Glu 
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115 



120 



125 



Ser Lys Asn Asp Pro Glu Leu Gly Lys Tyr Trp Ala Ser Leu Gly Asp 
130 135 140 

Gly lie Phe Val Asn Asp Ala Phe Gly Thr Ala His Arg Ala His Ala 
145 150 155 160 

Ser Asn Val Gly He Ser Ala Asn Val Glu Lys Ala Val Ala Gly Phe 
165 170 175 

Leu Leu Glu Asn Glu He Ala Tyr He Gin Glu Ala Val Glu Thr Pro 
180 185 190 

Glu Arg Pro Phe Val Ala He Leu Gly Gly Ser Lys Val Ser Asp Lys 
195 200 205 

lie Gly Val He Glu Asn Leu Leu Glu Lys Ala Asp Lys Val Leu He 
210 215 220 

Gly Gly Gly Met Thr Tyr Thr Phe Tyr Lys Ala Gin Gly He Glu He 
225 230 235 240 

Gly Asn Ser Leu Val Glu Glu Asp Lys Leu Asp Val Ala Lys Ala Leu 
245 ,250 255 



Leu Glu Lys Ala Asn Gly Lys Leu He Leu Pro Val Asp Ser Lys Glu 
260 265 270 



Ala Asn Ala Phe Ala Gly Tyr Thr 
275 280 

Ala Val Ser Glu Gly Phe Leu Gly 
290 295 

Ala Lys Phe Asp Glu Ala Leu Thr 
305 310 

Gly Pro Met Gly Val Phe Glu Asn 
325 

Gly Val Met Asp Ala He Val Lys 
340 

Gly Gly Gly Asp Ser Ala Ala Ala 

355 360 

Lys Phe Ser Trp He Ser Thr Gly 
370 375 

Glu Gly Lys Val Leu Pro Gin Leu 
385 .390 



Glu Val Arg Asp Thr Glu Gly Glu 
285 

Leu Asp He Gly Pro Lys Ser He 
300 

Gly Ala Lys Thr Val Val Trp Asn 
315 320 

Pro Asp Phe Gin Ala Gly Thr He 
330 335 

Gin Pro Gly Val Lys Ser He He 
345 350 

Ala He Asn Leu Gly Arg Ala Asp 
365 

Gly Gly Ala Ser Met Glu Leu Leu 
380 

Ala Ala Leu Thr Glu Lys 
395 



<210> 179 
<211> 165 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 179 

Met Leu Lys Ser Glu Lys Gin Ser Arg Tyr Gin Met Leu Asn Glu Glu 
15 10 15 

Leu Ser Phe Leu Leu Glu Gly Glu Thr Asn Val Leu Ala Asn Leu Ser 
20 25 30 

Asn Ala Ser Ala Leu lie Lys Ser Arg Phe Pro Asn Thr Val Phe Ala 
35 40 45 

Gly Phe Tyr Leu Phe Asp Gly Lys Glu Leu Val Leu Gly Pro Phe Gin 
50 55 60 

Gly Gly Val Ser Cys lie Arg lie Ala Leu Gly Lys Gly Val Cys Gly 
65 70 75 80 

Glu Ala Ala His Phe Gin Glu Thr Val lie Val Gly Asp Val Thr Thr 
85 90 95 

Tyr Leu Asn Tyr lie Ser Cys Asp Ser Leu Ala Lys Ser Glu lie Val 
* 100 105 110 

Val Pro Met Met Lys Asn Gly Gin Leu Leu Gly Val Leu Asp Leu Asp 
115 120 125 

Ser Ser Glu lie Glu Asp Tyr Asp Ala Met Asp Arg Asp Tyr Leu Glu 
130 135 140 

Gin Phe Val Ala lie Leu Leu Glu Lys Thr Ala Trp Asp Phe Thr Met 
145 150 155 160 

Phe Glu Glu Lys Ser ' 
165 



<210> 180 

<211> 209 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 180 

Met Thr lie Glu Leu Leu Thr Pro Phe Thr Lys Val Glu Leu Glu . Pro 
1 5 10 15 

Glu lie Lys Glu Lys Lys Arg Lys Gin .Val Gly lie Leu Gly Gly Asn 
20 25 30 

Phe Asn Pro Val His Asn Ala His Leu lie Val Ala Asp Gin Val Arg 
35- 40 45 

Gin Gin Leu Gly Leu Asp Gin Val Leu Leu Met Pro Glu Tyr Gin Pro 
50 55 60 

Pro His Val Asp Lys Lys Glu Thr lie Pro Glu His His Arg Leu Lys 
65 70 75 80 
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Met Leu Glu Leu Ala lie Glu Gly He Asp Gly Leu Val He Glu Thr 
85 90. 95 

5 He Glu Leu Glu Arg Lys Gly He Ser Tyr Thr Tyr Asp Thr Met Lys 
100 105 110 

He Leu Thr Glu Lys Asn Pro Asp Thr Asp Tyr Tyr Phe He lie Gly 
115 120 125 

10 

Ala Asp Met Val Asp Tyr Leu Pro Lys Trp Tyr Arg lie Asp Glu Leu 
130 135 • 140 

Val Asp Met Val Gin Phe Val Gly Val Gin Arg Pro Arg Tyr Lys Val 
15 145 150 155 160 

Gly Thr Ser Tyr Pro Val lie Trp Val Asp Val Pro Leu Met Asp lie 
165 170 175 

20 Ser Ser Ser Met Val Arg Ala Phe Leu Ala Gin Gly Arg Lys Pro Asn 
180 - 185 190 

Phe Leu Leu Pro Gin Pro Val Leu Asp Tyr lie Glu Lys Glu Gly Leu 
195 200 205 

25 

Tyr 



30 <210> 181 

<211> 255 

<212> PRT 

<213> Streptococcus pneumoniae 

35 <400> 181 

Met Asn He Ala Lys. He Val Arg Glu Ala Arg Glu Gin Ser Arg Leu 
.1 5 10 15 

Thr Thr Leu Asp Phe Ala Thr Gly He Phe Asp Glu Phe lie Gin Leu 
40 20 25 30 

His Gly Asp Arg Ser Phe Arg Asp Asp' Gly Ala Val Val Gly Gly lie 
35 40 45 

45 Gly Trp Leu Gly Asp, Gin Ala Val Thr Val Val Gly lie Gin Lys Gly 
50 55 60 

Lys Ser Leu Gin Asp Asn Leu Lys Arg Asn Phe Gly Gin Pro His Pro 
65 70 75 80 

50 

Glu Gly Tyr Arg Lys Ala Leu Arg Leu Met Lys Gin Ala Glu Lys Phe 
85 90 95 

Gly Arg Pro Val Val Thr Phe lie Asn Thr Ala Gly Ala Tyr Pro Gly 
55 100 105 110 

Val Gly Ala Glu Glu Arg Gly Gin Gly Glu Ala He Ala Arg Asn Leu 
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115 120 125 

Met Glu Met Ser Asp Leu Lys Val Pro lie lie Ala lie lie lie Gly 
130 135 140 

Glu Gly Gly Ser Gly Gly Ala Leu Ala Leu Ala Val Ala Asp Arg Val 
145 150 155 160 

Trp Met Leu Glu Asn Ser lie Tyr Ala lie Leu Ser Pro Glu Gly Phe 
165 170 175 

Ala Ser lie Leu Trp Lys Asp Gly Thr Arg Ala Met Glu Ala Ala Glu 
180 " 185 190 

Leu Met Lys He Thr Ser His Glu Leu Leu Glu Met Asp Val Val Asp 
195 200 205 

Lys Val He Ser Glu Val Gly Leu Ser Ser Lys Glu Leu He Lys Ser 
210 215 220 

Val Lys Lys Glu Leu Gin Thr Glu Leu Ala Arg Leu Ser Gin Lys Pro 
225 230 235 240 

Leu Glu Glu Leu Leu Glu Glu Arg Tyr Gin Arg Phe Arg Lys Tyr 
245 250 255 



<210> 182 
<211> 169 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 182 

Met He He Lys Val Glu Met Ala Asp Val Glu Val Leu Ala Lys He 
1 5 10 15 

Ala Lys Gin Thr Phe Arg Glu Thr Phe Ala Tyr Asp Asn Thr Glu Glu 
20 25 30 

Gin Leu Gin Glu Tyr Phe Glu Glu Ala Tyr Ser Leu Lys Thr Leu Ser 
35 40 45 

Thr Glu Leu Gly Asn Pro Asp Ser Glu Thr Tyr Phe He Met His Glu 
50 55 60 

Glu Glu He Ala Gly Phe Leu Lys Val Asn Trp Gly Ser Ala Gin Thr 

65 70 ...... 75 80 

-Glu Arg Glu Leu Glu Asp Ala Phe Glu He Gin Arg Leu Tyr Val Leu* 
85 90 95- 

Gin Lys Phe Gin Gly Phe Gly Leu Gly Lys Gin Leu Phe Glu Phe Ala 
100 105 110 

Leu Glu Leu Ala Thr Lys Asn Ser Phe Ser Trp Ala Trp Leu Gly Val 
115 120 125 
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Trp Glu His Asn Thr Lys Ala Gin Ala Phe Tyr Asn Arg Tyr Gly Phe 

130 135 140 

Glu Lys Phe Ser Gin His His Phe Met Val Gly Gin Lys Val Asp Thr 

145 . 150 155 160 

Asp Trp Leu Leu Arg Lys Lys Leu Arg 
165 



<210> 183 
<211> 529 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 183 

Met Leu Arg Gly Thr Ala Leu Leu Thr Ala Ser Asn Phe lie Ser Arg 
1 5 10 15 

Leu Leu Gly Ala Val Tyr lie He Pro Trp Tyr He Trp Met Gly Ala 
20 25 30 

Tyr Ala Ala Lys Ala Asn Gly Leu Phe Thr Met Gly Tyr Thr He Tyr 
35 40 45 

Ala Trp Phe Leu Leu Val Ser Thr Ala Gly He Pro Val Ala Val Ala 
50 55 60 

Lys Gin Val Ala Lys Tyr Asn Thr Met Arg Glu Glu Glu His Ser Phe 
65 70 75 80 

Ala Leu He Arg Ser Phe Leu Gly Phe Met Thr Gly Leu Gly Leu Val 
85 90 95 

Phe Ala Leu Val Leu Tyr Val Phe Ala Pro Trp Leu Ala Asp Leu Ser 
100 . 105 110 

Gly Val Gly Lys Asp Leu He Pro He Met Gin Ser Leu Ala Trp Gly 
115 120 125 

Val Leu lie Phe Pro Ser Met Ser Val He Arg Gly Phe Phe Gin Gly 
130 135 140 

Met Asn Asn Leu Lys Pro Tyr Ala Met Ser Gin He Ala Glu Gin Val 
145 150 155 160 

He Arg Val He Trp Met Leu Leu Ala /Thr Phe He He Met Lys Leu 
165 170 175 

Gly Ser Gly Asp Tyr Leu Ala Ala Val Thr Gin Ser Thr Phe Ala Ala 
180 185 190 

Phe Val Gly Met Val Ala Ser Phe Ala Val Leu He Tyr Phe Leu Ala 
195 . 200 205 

Gin Glu Ser Ser Leu Lys Arg Val Phe Glu Thr Gly Asp Lys He Asn 
210 215 220 
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10 



Ser Lys Arg Leu Leu Val Asp Thr lie Lys Glu Ala lie Pro Phe lie 
225 230 235 240 

Leu Thr Gly Ser Ala lie Gin lie Phe Gin lie Leu Asp Gin Leu Thr 
245 250 255 

Phe lie Asn Ser Met Ser Trp Phe Thr Asn Tyr Ser Asn Glu Asp Leu 
260 265 270 

Val Val Met Phe .Ser Tyr Phe Ser Ala Asn Pro Asn Lys lie Thr Met 
. 275 280 285 



He Leu He Ser Val Gly Val Ser He Gly Ser Val Gly Leu Pro Leu 
15 290 295 300 

Leu Thr Glu Asn Tyr Val Lys Gly Asp Leu Lys Ala Ala Ser Arg Leu 
305 310 315 320 

20 Val Gin Asp Ser Leu Thr Leu Leu Phe Met Phe Leu Leu Pro Ala Thr 

325 330 335 



25 



Val Gly Val Val Met Val Gly Glu Pro Leu Tyr Thr Val Phe Tyr Gly 
340 345 350 

Lys Pro Asp Ser Leu Ala Leu Gly Leu Phe Val Phe Ala Val Leu Gin 
355 360 365 



Ser He He Leu Gly Leu Tyr Met Val Leu Ser Pro Met Leu Gin Ala 
30 370 375 380 

Met Phe Arg Asn Arg Lys Ala Val Leu Tyr Phe He Tyr Gly Ser He 
385 390 395 400 

35 Ala Lys Leu Val Leu Gin Leu Pro Thr He Ala Leu Phe His Ser Tyr 

405 410 415 



40 



Gly Pro Leu He Ser Thr Thr He Ala Leu He He Pro Asn Val Leu 
420 425 430 

Met Tyr Arg Asp lie Cys Lys Val Thr Gly Val Lys Arg Lys Val He 
435 440 445 



Leu Lys Arg Thr He Leu He Ser Leu Leu Thr Leu Val Met Phe Leu 
45 450 455 460 

Leu He Gly Thr He Gin Trp Leu Leu..Gly Phe Phe Phe Gin Pro Ser 

465 470 475 480 

50 Gly Arg Leu Trp Ser Phe Phe Tyr Val Ala Leu Val Gly Ala Met Gly 

485 490 495 



55 



Gly Gly Leu Tyr Met Val Met Ser Leu Arg Thr Tyr Leu Leu Asp Lys 

500 505 510 

Val He Gly Lys Ala Gin Ala Asp Arg Leu Arg Ala Lys Phe Lys Leu 
515 520 525 
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Ser 



<210> 184 

<211> 155 

<212> PRT . 

<213> Streptococcus 



pneumoniae 



<400> 184 

Met Ser Asp Lys lie Gly Leu Phe Thr Gly Ser Phe Asp Pro Met Thr 
1 5 10 15 

Asn- Gly His Leu Asp lie lie Glu Arg Ala Ser Arg Leu Phe Asp Lys 
20 25 30 



Leu Tyr Val Gly lie Phe Phe Asn 
35 40 

lie Glu Asn Arg Lys Arg Gly Leu 
50 55 

Asn Val Glu Val Val Ala Ser His 
65 70 

Lys Arg Leu Gly Ala Thr Cys Leu 
85 

Asp Leu Gin Tyr Glu Ala Ser Phe 
100 

Ser Asp lie Glu Thr He Tyr Leu 
115 120 



Pro His Lys Gin Gly Phe Leu Pro 
45 

Glu Lys Ala Leu Gly His Leu Glu 
60 

Asp Glu Leu Val Val Asp Val Ala 
75 80 

Val Arg Gly Leu Arg Asn Ala Ser 
90 95 

Asp Tyr Tyr Asn His Gin Leu Ser 
105 110 

His Ser Arg Pro Glu His Leu Tyr • 
125 



He Ser Ser Ser Gly Val Arg Glu Leu Leu Lys Phe Gly Gin Asp He 
130 135 140 

Ala Cys Tyr Val Pro Glu Ser He Trp Arg Lys 
145 150 155 



<210> 185 

<211> 143 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 185 •- 

Met Thr He Leu Phe Val Val He Ser Ala Ser Phe Leu Tyr Met Val 
1 5 10 15 

Ser Leu Ser Met Lys Pro Tyr Gin Thr Ala Lys Ser Glu. Gly Glu Lys 
20 25 30 

Leu Ala Gin Gin Tyr Ala Gly Leu Glu Gin Ala Asp Gin Val Asp Leu 
35 40 45 
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Tyr Asn Gly Leu Glu Ser Tyr Tyr 
50 55 

Gin Glu Ala Leu Ala Val Leu lie 
5 65 '70 

Val Tyr Gin Leu Asn Gin Gly Val 
85 

10 Ser Lys Glu Lys Gly Ala Gly Glu 
100 



Ser Val Leu Gly Arg Asn Lys Gin 
60 

Gly Lys Asp Asp His Lys lie Tyr 

75 80 

Ser Gin Glu Lys Ala Glu Thr Val 
90 95 

lie Asp Lys lie He Phe Gly Arg 
105 110 



15 



Tyr Gin Asp Lys Pro He Trp Glu Val Lys Ser Gly Ser Asp Phe Tyr 
115 120 125 

Leu Val Asp Phe Glu Thr Gly Ala Leu Val Asn Lys Glu Gly Leu 
130 135 140 



20 <210> 186 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 
25 <400> 186 

Met He Asp He His" Ser His He Val Phe Asp Val Asp Asp Gly Pro 
15 10 15 

Lys Ser Arg Glu Glu Ser Lys Ala Leu Leu Thr Glu Ala Tyr Arg Gin 
30 20 25 30 

Gly Val Arg Thr He Val Ser Thr Ser His Arg Arg Lys Gly Met Phe 
35 40 45 

35 Glu Thr Pro Glu Glu Lys He Ala Glu Asn Phe Leu Gin Val Arg Glu 
50 55 60 



40 



He Ala Lys Glu Val Ala Ser Asp. Leu Val He Ala Tyr Gly Ala Glu 

65 70 75 80 

He Tyr Tyr Thr Pro Asp Val Leu Asp Lys Leu Glu Asn Asn Arg He 

85 90 95 



Pro Thr Leu Asn Asn Ser Arg Tyr Ala Leu He Glu Phe Ser Met Asn 
45 100 105 HO 

Thr Pro Tyr Arg Asp He His Ser Ala. Leu Asn Lys He Leu Met Leu 
115 120 125 

50 Gly He Thr Pro Val He Ala His He Glu Arg Tyr Asp Val Leu Glu 
130 135 140 

Asn Asn Glu Lys" Arg Val Arg Glu Leu He Asp Met Gly Cys Tyr Thr 
145 150 155 160 

.55 

Gin He Asn Ser Ser His Val Leu Lys Ser Lys Leu Phe Gly Glu Pro 
165 170 175 
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Tyr Lys Phe Met Lys Lys Arg Ala 
180 

Val His lie lie Ala Ser Asp Met 
195 200 

His Met Ala Glu Ala Tyr Asp Leu 
210 215 

Lys Ala Gin Glu Leu Phe lie Asp 
225 230 

Gin Leu lie 



Gin Tyr Phe Leu Glu Arg Asp Leu 
185 190 

His Asn Val Asp Gly Arg Pro Pro 
205 

Val Ser Gin Lys Tyr Gly Glu Ala 
220 

Asn Pro Arg Lys lie Val Met Asp 
.235 240 



<210> 187 
<211> 308 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 187 

Met Ser Thr lie Asp Lys Glu Lys Phe Gin Phe Val Lys Arg Asp Asp 
15 10 15 

Phe Ala Ser Glu Thr lie Asp Ala Pro Ala Tyr Ser Tyr Trp Lys Ser 
20 25 30 



Val Phe Lys Gin Phe Met Lys Lys 
35 40 

He Leu Val Ala He He Leu lie 
50 55 

Lys Phe Asp Phe Asn Asp Val Ser 
65 70 

Tyr He Lys Pro Asn Ala Glu His 
85 

Lys Ser Leu Phe Asp Gly Val Trp 
100 

He Ser Val He Ala Thr Val He 
115 120 

Gly Gly He Trp Gly He Ser Lys 
130 135 



Lys Ser Thr Val Val Met Leu Gly 
45 

Ser Phe He Tyr Pro Met Phe Ser 
60 

Lys Val Asn Asp Phe Ser Val Arg 
75 80 

Trp Phe Gly Thr Asp Ser Asn Gly 
90 95 

Phe Gly Ala Arg Asn Ser He Leu 
105 110 

Asn Leu Val He Gly Val Phe Val 
' 125 

Ser Val Asp Arg Val Met Met Glu 
140 



Val Tyr Asn Val He Ser Asn He Pro Pro Leu Leu He Val He Val 
145 150 155 160 

Leu Thr Tyr Ser lie Gly Ala Gly Phe Trp Asn Leu He Phe Ala Met 
165 170 175 

Ser Val Thr Thr Trp lie Gly He Ala Phe Met He Arg Val Gin He 
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180 185 190 

Leu Arg Tyr Arg Asp Leu Glu Tyr Asn Leu Ala Ser Arg Thr Leu Gly 
195 200 205 

5 

Thr Pro Thr Leu Lys lie Val Ala Lys Asn lie Met Pro Gin Leu Val 
210 215 220 

Ser Val He Val Thr Thr Met Thr Gin Met Leu Pro Ser Phe He Ser 
10 225 230 235 240 

Tyr Glu Ala Phe Leu Ser Phe Phe Gly Leu Gly Leu Pro He Thr Val 
245 250 255 

15 Pro Ser Leu Gly Arg Leu He Ser Asp Tyr Ser Gin Asn Val Thr Thr 
■ 260 265 270 

Asn Ala Tyr Leu Phe Trp He Pro Leu Thr Thr Leu Val Leu Val Ser 
275 280 285 

20 

Leu Ser Leu Phe Val Val Gly Gin Asn Leu Ala Asp Ala Ser Asp Pro 
290 295 300 

Arg Thr His Arg 
25 305 



<210> 188 

<211> 77 

30 <212> PRT 

- <213> Streptococcus pneumoniae 

<400> 188 

Met Tyr Asn Leu Leu Leu Thr He Leu Leu Val Leu Ser Val Val He 
35 1 . 5 .10 15 

Val He Ala He Phe Met Gin Pro Thr Lys Asn Gin Ser Ser Asn Val 
20 25 30 

40 Phe Asp Ala Ser Ser Gly Asp Leu Phe Glu Arg Ser Lys Ala Arg Gly 
.35 40 45 

Phe Glu Ala Val Met Gin Arg Leu Thr Gly He Leu Val Phe Phe Trp 
50 55 60 

45 

Leu Ala He Ala Leu Ala Leu Thr Val Leu Ser Ser Arg 

65 70 .... 75 



50 <210> 189 
<211> 369 
<212> PRT 

<213> Streptococcus pneumoniae 
55 <400> 189 

Met Phe Arg Arg Asn Lys Leu Phe Phe Trp Thr Thr Glu He Leu Leu 
15 10 15 
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Leu Thr He He Phe Tyr Leu Trp Arg Gin Met Gly Ser Leu He Asn 
20 25 30 

Pro Phe Val Ser Val Leu Asn Thr He Met He Pro Phe Leu Leu Gly 
35 40 45 



10 



Gly Phe Leu Tyr Tyr Leu Thr Asn Pro He Val Thr Phe Leu Asn Lys 
50 55 ' 60 

Val Cys Lys Leu Asn Arg" Leu Leu Gly He Leu He Thr Leu Cys Thr 
65 70 75 80 



Leu Val Trp Gly Met Val He Gly Val Val Tyr Leu Leu Pro He Leu 
15 ' 85 90 95 

He Asn Gin Leu Ser Ser Leu He He Ser Ser Gin Thr He Tyr Ser 
100 105 110 

20 Arg Val Gin Asp Leu He He Asp Leu Ser Asn Tyr Pro Ala Leu Gin 
115 120 125 



25 



Asn Leu Asp Val Glu Ala Thr He Gin Gin Leu Asn Leu Ser Tyr Val 
130 135 140 

Asp lie Leu Gin Asn lie Leu Asn Ser Val Ser Asn Ser Val Gly Ser 
145 150 155 160 



Val Leu Ser Ala Leu He Ser Thr Val Leu lie Leu lie Met Thr Pro 
30 165 170 175 

Val Phe Leu Val Tyr Phe Leu Leu Asp Gly His Lys Phe Leu Pro .Met 
180 185 190 

35 Leu Glu Arg Thr He Leu Lys Arg Asp Arg Leu His lie Ala Gly Leu" 
195 200 205 



40 



Leu Lys Asn Leu Asn Ala Thr lie Ala Arg Tyr lie Ser Gly Val Ser 
210 215 220 

lie Asp Ala He He lie Gly Cys Leu Ala Tyr lie Gly Tyr Ser lie 

225 230 235 240 



lie Gly Leu Lys Tyr Ala Leu Val Phe Ala lie Phe Ser Gly Val Ala 
45 245 250 255 

Asn Leu He Pro Tyr Val Gly Pro Ser.Jle Gly Leu lie Pro Met He 
260 265 270 

50 lie Ala Asn lie Phe Thr Val Pro His Arg Leu Leu lie Ala Val He 

275 280 285 



55 



Tyr Met Leu Val Val Gin Gin Val Asp Gly Asn lie Leu Tyr Pro Arg 
290 295 300 

lie Val Gly Ser Val Met Lys Val His Pro lie Thr He Leu Val Leu 
305 310 315 320. 
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Leu Leu Leu Ser Ser Asn lie Tyr Gly Val Val Gly Met lie Val Ala 
325 330 335 

Val Pro Thr Tyr Ser lie Leu Lys Glu lie Ser Lys Phe Leu Ser Arg 
340 345 350 

Leu Tyr Glu Asn His Lys lie Met Lys Glu Arg Glu Arg Glu Leu Ala 
355 360 365 



<210> 190 

<211> 451 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 190 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Asn Phe Ser Gin 

1.5 10 15 

Leu Val Gly Gin Glu Val Val Ala Lys Thr Leu Lys Gin Ala Val Glu 
20 25 *30 

Gin Glu Lys lie Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Val Ala Lys lie Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Gly Gly Glu Pro Cys Asn Asn Cys Tyr lie Cys Gin Ala 
65 70 75 80 

Val Thr Asp Gly Ser Leu Glu Asp Val lie Glu Met Asp Ala Ala Ser 
85 90 95 

Asn Asn Gly Val Asp Glu lie Arg Glu He Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Leu Ala Arg Tyr Lys Val Tyr He He Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala -Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Gin Asn Val Val Phe He Leu Ala Thr Thr Glu Leu His Lys 
145 150 155 160 

He Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ser 
165 170 175 

He Lys Thr Gin Asp He Lys Glu His He His Tyr He Leu Glu Lys 
180 185 190 

Glu Asn He Ser Ser Glu Pro Glu Ala Val Glu lie lie Ala Arg Arg 
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195 



200 



205 



Ala Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu 
210 215 220 

Ser Leu Thr Gin Gly Asn Glu Leu Thr Thr Ala He Ser Glu Glu He 
225 230 235 240 

Thr Gly Thr He Ser Leu Ser Ala Leu Asp Asp Tyr Val Ala Ala Leu 
245 250 255 

Ser Gin Gin Asp Val Pro Lys Ala Leu Ser Cys Leu Asn Leu Leu Phe 
260 265. 270 

Asp Asn Gly Lys Ser Met Thr Arg Phe Val Thr Asp Leu Leu His Tyr 
275 280 285 

Leu Arg Asp Leu Leu He Val Gin Thr Gly Gly Glu Asn Thr His His 
290 295 300 

Ser Ser Val Phe Val Glu Asn Leu Ala Leu Pro Gin Lys Asn Leu Phe 
305 310 315 320 

Glu Met He Arg Leu Ala Thr Val Asn Leu Ala Asp He Lys Ser Ser 
325 330 335 

Leu Gin Pro Lys He Tyr Ala Glu Met Met Thr Val Arg Leu Ala Glu 
340 345 350 

He Lys Pro Glu Pro Ala Leu Ser Gly Ala Val Glu Asn Glu He Ala 
355 360 365 

Thr Leu Arg Gin Glu Val Ala Arg Leu Lys Gin Glu Leu Ser Asn Ala 
370 375 380 

Gly Ala Val Pro Lys Gin Val Ala Pro Ala- Pro Ser Arg Pro Ala Thr 
385 390 395 400 

Gly Lys Thr Val Tyr Arg Val Asp Arg Asn Lys Val Gin Ser He Leu 
405 410 415 

Gin Glu Ala Val Glu Asn Pro Asp Leu Thr Arg Gin Asn Leu He Arg 
420 425 430 

Leu Gin Asn Ala Trp Gly Glu Val He Glu Ser Leu Gly Gly Pro Asp 
435 440 445 



Lys Leu Cys 
450 



<210> 191 
<211> 662 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 191 
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Met Phe Arg Leu Thr Asn Lys Leu Ala Val Ser Asn Leu lie Lys Asn 
15 10 15 

Arg Lys Leu Tyr Tyr Pro Phe Ala Leu Ala Val Leu Leu Ala Val Thr 
20 25 30 

Leu Thr Tyr Leu Phe Tyr Ser Leu Thr Phe Asn Pro Lys lie Ala Glu 
35 40 45 

lie Arg Gly Gly Thr ' Thr lie Gin Ala Thr Leu Gly Phe Gly Met Phe 
50 55 60 

Val Val Thr Leu Ala Ser Ala lie He Val Leu Tyr Ala Asn Ser Phe 
65 70 75 80 

Val Met Lys Lys Arg Ser Lys Glu Leu Gly He Tyr Gly Met Leu Gly 
85 90 95 

Leu Glu Lys Arg His Leu He Ser Met Thr Phe Lys Glu Leu Val Val 
100 105 110 

Phe Gly He Leu Thr Val Gly Ala Gly He Gly He Gly Ala Leu Phe 
115 120 125 

Asp Lys Leu lie Phe Ala Phe Leu Leu Lys Leu Met Lys Leu Lys Val 
130 135 140 

Glu Leu Val Ala Thr Phe Gin Thr Lys Val Val He Thr Val Leu Val 
145 150 155 160 

Val Phe Gly Leu lie Phe Leu Gly Leu Met Phe Leu Asn Ala Leu Arg 
165 170 175 

lie Ala Arg Met Asn Ala Leu Gin Leu Ser Arg Glu Lys Ala Ser Gly 
180 185 190 

Glu Lys Lys Gly Arg Phe Leu Pro Leu Gin Thr He Leu Gly Ser He 
195 200 205 

Ser Leu Gly He Gly Tyr Tyr Leu Ala Leu Thr Val Lys Asp Pro Leu 
210 215 220 

Thr Ala Leu Thr Thr Phe Phe lie Ala Val Leu Leu Val He Phe Gly 
225 230 235 240 

Thr Tyr Leu Leu Phe Asn Ala Gly lie Thr Val Phe Leu Gin lie Leu 
245 250 255 

Lys Lys Asn Lys Lys Tyr Tyr Tyr Gin Pro Asn Asn Leu lie Ser Val 
260 265 270 

Ser Asn Leu lie Phe Arg Met Lys Lys Asn Ala Val Gly Leu Ala Thr 
275 280 285 



lie Ala lie Leu Ser Thr Met Val Leu Val Thr Met Ser Ala Ala Thr 
290 295 300 
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Ser lie Phe Asn Ser Ala Glu Ser Phe Lys Lys Val Leu Asn Pro His 
305 310 m 315 320 

Asp Phe Gly Val Ser Gly Gin Asn Val Glu Lys Glu Asp Leu Asp Lys 
325 330 335 

Leu Leu Ser Gin Phe Ala Ser Asp Asn Gly Tyr Lys He Lys Glu Lys 
340 345 350 

Glu Val Phe Arg Tyr Thr Tyr Phe Gly Val Ala Asn Gin Glu Gly Asn 
355 360 365 

Lys Leu Thr Phe Phe Glu Lys Gly Gin Asn Arg Val Gin Pro Thr Thr 
370 375 380 

Val Phe Met Val Phe Asp Gin Lys Asp Tyr Glu Asn Met Thr Gly Gin 
385 390 395 400 

Lys Leu Ser Leu Ser Gly Asn Glu Val Gly Leu Phe Ala Lys Asn Asp 
405 410 415 

Gly Leu Lys Gly Gin Lys Thr Leu He Leu Asn Asp His Gin Phe Ser 
420 425 430 

Val Lys Glu Glu Phe Asn Lys Asp Phe Xle Val Asn His Val Pro Asn 
435 440 445 

Gin Phe Asn lie Leu Thr Ala Asp Tyr Asn Tyr Leu Val Val Pro Asp 
450 455 460 

Leu Gin Ala Phe Leu Asn Gin Phe Pro Asp Ser Asp lie Tyr Asn Gin 
465 470 475 480 

Phe Tyr Gly Gly Met Asn Val Asn Val Ser Glu Glu Glu Gin Leu Lys 
485 490 495 

Val Ala Glu Glu Tyr Glu Asn Tyr Leu Asn Gin Phe Asn Ala Gin Leu 
500 505 510 

Asp Thr Glu Gly Ser Tyr Val Tyr Gly Ser Asn Leu Ala Asp Ala Ser 
515 520 525 

Ser Gin Met Ser Ala Leu Phe Gly Gly Val Phe Phe He Gly He Phe 
530 535 540 

Leu Ser lie lie Phe Met Val Gly Thr Val Leu Val lie Tyr Tyr Lys . 
545 550 555 560 • 

Gin lie Ser Glu Gly Tyr Glu Asp Arg Glu Arg Phe lie lie Leu Gin 
565 570 575 

Lys Val Gly Leu Asp Gin Lys Gin He Lys Gin Thr lie His Lys Gin 
580 585 590 



Val Leu Thr Val Phe Phe Leu Pro Leu Leu Phe Ala Phe lie His Leu 
* ■ 595 600 605 
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Ala Phe Ala Tyr His Met Leu Ser Leu He Leu Lys Val He Gly Val 
610 615 620 

Leu Asp Thr Thr Met Met Leu He Val Thr Leu Ser He Cys Ala He 
625 630 635 640 

Phe Leu He Ala Tyr Val Leu He Phe Met He Thr Ser Arg Ser Tyr 
645 650 655 

Arg Lys He Val Gin Met 
660 



<210> 192 

<211> 296 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 192 

Met Lys Gin Asp Gin Leu Lys Ala Trp Gin Pro Ala Gin Phe Asp Arg 
1 5 10 15 

Phe Val Arg He Leu Glu Gin Asp Gin Leu Asn His Ala Tyr Leu Phe 
20 • 25 30 

Ser Gly Phe Phe Gly Ser Leu Glu Met Ala Gin Phe Leu Ala Lys Ser 
35 40 45 

Leu Phe Cys Thr Asp Lys Val Gly Val Leu Pro Cys Glu Lys Cys Arg 
50 55 60 

Ser Cys Lys Leu He Glu Gin Glu Glu Phe Pro Asp Val Thr Leu He 
65 70 75 80 

Lys Pro Val Asn Gin Val He Lys Thr Glu Arg He Arg Glu Leu Val 
85 90 95 

Gly Gin Phe Ser Gin Ala Gly lie Glu Ser Gin Gin Gin Val Phe He 
100 105 110 

lie Glu Gin Ala Asp Lys Met His Pro Asn Ala Ala Asn Ser Leu Leu 
115 120 125 

Lys Val He Glu Glu Pro Gin Ser Glu Val Tyr He Phe Phe Leu Thr 
130 135 140 

Ser Asp Glu Glu Lys Met Leu Pro Thr He Arg Ser Arg Thr Gin He 
145 150 155 160 

Phe His Phe Lys Lys Gin Glu Glu Lys Leu He Leu Leu Leu Glu Glh 
' 165 170' 175 

Met Gly Leu Val Lys Lys Lys Ala Thr Leu Leu Ala Lys Phe Ser Gin 
180 185 190 

Ser Arg Ala Glu Ala Glu Lys' Leu Ala Asn Gin Ala Ser Phe Trp Thr 
195 ' 200 205 
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Leu Val Asp Glu Ser Glu Arg Leu Leu Thr Trp Leu Val Ala Lys Lys 
210 215 220 

Lys Glu Ser Tyr Leu Gin Val Ala Lys Leu Ala Asn Leu Ala Asp Asp 
225 230 235 240 

Lys Glu Lys Gin Asp Gin Val Leu Arg lie Leu Glu Val Leu Cys Gly 
245 250 255 

Gin Asp Leu Leu Gin Val Arg Val Arg Val lie Leu Gin Asp Leu Leu 
260 . 265 270 

Glu Ala Arg Lys Met Trp Gin Ala Asn Val Ser Phe Gin Asn Ala Met 
275 280 285 

Glu Tyr Leu Val Leu Lys Glu lie 
290 295 



<210> 193 
<211> 204 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 193 

Met Asn Ser Phe Lys Asn Phe Leu Lys Glu Trp Gly Leu Phe Leu Leu 
1 5 10 15 

He Leu Ser Leu Leu Ala Leu Ser Arg He Phe Phe Trp Ser Asn Val 
20 25 30 

Arg Val Glu Gly His Ser Met Asp Pro Thr Leu Ala Asp Gly Glu He 
35 40 * 45 

Leu Phe Val Val Lys His Leu Pro He Asp Arg Phe Asp lie Val Val 
50 55 60 

Ala His Glu Glu Asp Gly Asn Lys Asp He Val Lys Arg Val He Gly 
65 70 75' 80 

Met Pro Gly Asp Thr He Arg Tyr Glu Asn Asp Lys Leu Tyr He Asn 
85 90 95 

Asp Lys Glu Thr Asp Glu Pro Tyr Leu Ala Asp Tyr He Lys Arg Phe 
100 105 110 

Lys Asp Asp Lys Leu Gin Ser Thr Tyr Ser Gly Lys Gly Phe Glu Gly 
115 120 125 

Asn Lys Gly Thr Phe Phe Arg Ser He Ala Gin Lys Ala Gin Ala Phe 
130 135 140 

Thr Val Asp Val Asn Tyr Asn Thr Asn Phe Ser Phe Thr Val Pro Glu 
145 150 155 160 

Gly Glu Tyr Leu Leu Leu Gly Asp Asp Arg Leu Val Ser Ser Asp Ser 
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165 



170 



175 



Arg His Val Gly Thr Phe Lys Ala Lys Asp lie Thr Gly Glu Ala Lys 
180 .185 190 



Phe Arg Phe Trp Pro lie Thr Arg lie Gly Thr Phe 
•195 200 



10 <210> 194 

<211> 328 

<212> PRT 

<213> Streptococcus pneumoniae 

15 <400> 194 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Gly 
1 5 10 15 

Leu Lys Glu Leu Asp He Pro Arg Met Lys Ala His He Lys Val He 
20 20 .25. 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 
35 40 45 

25 Gin Val Asp He Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 
50 55 60 

Gin Gin Val Val Lys Gly Val Pro Lys Lys He Asn Asp Leu Asn Glu 
65 70 75 80 

30 

Pro Val Lys Thr Val Ser Glu Glu Thr Val Asp Leu Gly His Val Val 
85 90 95 

Asn Ala He Lys Lys He Glu Glu Glu Gly Gin Gly He Ser Asp Glu 
35 100 105 110 

Val Lys Ala Glu He Leu Lys His Glu Arg His Ala Ser Thr He Leu 
115 120 125 

40 Glu Glu- Thr Gly His He Glu He Leu Asn Glu Leu Gin He Glu Glu 
130 135 140 

Ala Met Arg Glu Glu Ala Gly Ala Asp Asp Leu Glu Thr Glu Gin Asp 
145 150 155 160 

45 

Gin Thr Glu Asn Gin Asp Leu Lys Glu Met Gly Leu Lys Val Glu Gin 
165 ,170 175 

Ser Tyr Asp lie Ala Gin Val Ala Thr Asp Val Thr Ala Tyr Val Gin 
50 180 185 190 

Ala He Val Asp Asp Met Asp Val Glu Ala Thr Leu Ser Asn Asp Tyr 
195 200 205 

55 Asn Arg Arg Ser He Asn Leu Gin He Asp Thr Asn Glu Pro Gly Arg 
210 215 220 
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lie lie Gly Tyr His Gly Lys Val Leu Lys Ala Leu Gin Leu Leu Ala 
225 230 235 240 

Gin Asn Tyr Leu Tyr Asn Arg Tyr Ser Lys Thr Phe Tyr Val Thr lie 
5 245 250 255- 

Asn Val Asn Asp Tyr Val Glu His Arg Ala Glu Val Leu Gin Thr Tyr 
260 265 270 

10 Ala Gin Lys Leu Ala Asn Arg Val Leu Glu Glu Gly Arg Ser His Lys 
275 280 285 

Thr Asp Pro Met Ser Asn Ser Glu Arg Lys He He His Arg He He 
290 295 300 

15 

Ser Arg Met Asp Gly Val Thr Ser Tyr Ser Glu Gly Asp Glu Pro Asn 
305 310 315 320 

Arg Tyr Val Val Val Asp Thr Glu 
20 325 



• <210> 195 

<211> 460 

25 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 195 

Met Ser Asn Phe Ala He He Leu Ala Ala Gly Lys Gly Thr Arg Met 
30 1 5 10 15 

Lys Ser Asp Leu Pro Lys Val Leu His Lys Val Ala Gly He Ser Met 
20 25 .30 

35 Leu Glu His Val Phe Arg Ser Val Gly Ala lie Gin Pro Glu Lys Thr 
35 40 45 

Val Thr Val Val Gly His Lys Ala Glu Leu Val Glu Glu Val Leu Ala 
50 55 60 

40 

Gly Gin Thr Glu Phe Val Thr Gin Ser. Glu Gin Leu Gly Thr Gly His 
65 .70 75 80 

Ala Val Met Met Thr Glu Pro He Leu Glu Gly Val Ser Gly His Thr 
45 85 90 95 

Leu Val He Ala Gly Asp Thr Pro Leu -He Thr Gly Glu Ser Leu Lys 
100 105 110 

50 Asn Leu He Asp Phe His He Asn His Lys Asn Val Ala Thr He Leu 
115 120 125 

Thr Ala Glu Thr Asp Asn Pro Phe Gly Tyr Gly Arg He Val Arg Asn 
130 135 140 

55 

Asp Asn Ala Glu Val Leu Arg Ser Leu Leu Ser Arg Arg Met Leu Gin 
145 150 155 i60 
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lie Leu Lys Ser Lys Ser Arg Lys Ser Thr Leu Val Thr Tyr Val Phe 
165 170 175 

Asp Asn Glu Arg Leu Phe Glu Ala Leu Lys Asn lie Asn " Thr Asn Asn 
180 185 190 

Ala Gin Gly Glu Tyr Tyr He Thr Asp Val He Gly He Phe Arg Glu. 
195 200 205 

Thr Gly Glu Lys Val Gly Ala Tyr Thr Leu Lys Asp Phe Asp Glu Ser 
210 215 220 

Leu Gly Val Asn Asp Arg Val Ala Leu Ala Thr Ala Glu Ser Val Met 
225 230 235 240 

Arg Arg Arg He Asn His Lys His Met Val Asn Gly Val Ser Phe Val 
245 250 255 

Asn Pro Glu Ala Thr Tyr He Asp He Asp Val Glu He Ala Pro Glu 
260 265 270 

Val Gin He Glu Ala Asn Val He Leu Lys Gly Gin Thr Lys lie Gly 
275 280 285 

Ala Glu Thr Val Leu Thr Asn Gly Thr Tyr Val Val Asp Ser Thr He 
290 295 300 

Gly Ala Gly Ala Val lie Thr Asn Ser Met He Glu Glu Ser Ser Val 
305 310 315 320 

Ala Asp Gly Val Thr Val Gly Pro Tyr Ala His He Arg Pro Asn Ser 
325 330 335 

Ser Leu Gly Ala Gin Val His lie Gly Asn Phe Val Glu Val Lys Gly 
340 345 350 

Ser Ser He Gly Glu Asn Thr Lys Ala Gly His Leu Thr Tyr He Gly 
355 360 365 

Asn Cys Glu Val Gly Ser Asn Val Asn Phe Gly Ala Gly Thr lie Thr 
370 375 380 

Val Asn Tyr Asp Gly Lys Asn Lys Tyr Lys Thr Val He Gly Val Asn 
385 390 395 400 

Val Phe Val Gly Ser Asn Ser Thr lie -lie Ala Pro Val Glu Leu Gly 
405 410 415 

Asp Asn Ser Leu Val Gly Ala Gly Ser Thr He Thr Lys Asp Val Pro 
420 * 425 > 430 

Ala Asp Ala lie Ala lie Gly Arg Gly Arg Gin lie Asn Lys Asp Glu 
435 440 445 



Tyr Ala Thr Arg Leu Pro His His Pro Lys Asn Gin 
450 455 460 
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<210> 196 
<211> 311 
<212> PRT 

<21'3> Streptococcus pneumoniae 
<400> 196 

Met Ser Lys He Leu Val Phe Gly His Gin Asn Pro Asp Ser Asp Ala 
15 10 15 

He Gly Ser .Ser Val Ala Phe Ala Tyr Leu Ala Lys Glu Ala Tyr Gly 
20 25 30 

Leu Asp Thr Glu Ala Val Ala Leu Gly Thr Pro Asn Glu Glu Thr Ala 
35 40 45 

Phe Val Leu Asn Tyr Phe Gly Val Glu Ala Pro Arg Val He Thr Ser 
50 55 60 

Ala Lys Ala Glu Gly Ala Glu Gin Val He Leu Thr Asp His Asn Glu 
65 70 75 80 

Phe Gin Gin Ser Val Ser Asp He Ala Glu Val Glu Val Tyr Gly Val 
85 90 95 

Val Asp His His Arg Val Ala Asn Phe Glu Thr Ala Ser Pro Leu Tyr 
100 105 110 

Met Arg Leu Glu Pro Val Gly Ser Ala Ser Ser He Val Tyr Arg Met 
115 120 125 

Phe Lys Glu His Gly Val Ala Val Pro Lys Glu He Ala Gly Leu Met 
130 135 140 



Leu Ser Gly Leu He Ser Asp Thr Leu Leu Leu Lys Ser Pro Thr Thr 
145 150 155 160 

His Pro Thr Asp Lys He He Ala Pro Glu Leu Ala Glu Leu Ala Gly 
165 170 175 

Val Asn Leu Glu Glu Tyr Gly Leu Ala Met Leu Lys Ala Gly Thr Asn 
180 185 190 

Leu Ala Ser Lys Ser Ala Glu Glu Leu He Asp He Asp Ala Lys Thr 
195 200 205 

Phe Glu Leu Asn Gly Asn Asn Val Arg Val Ala Gin Val Asn Thr. Val 
210 215 220 

Asp lie Ala Glu Val Leu Glu Arg Gin Ala Glu He Glu Ala Ala Met 
225 230 235 240 

Gin Ala Ala Asn Glu Ser Asn Gly Tyr Ser Asp Phe Val Leu Met He 
245 250 255 

Thr Asp He Val Asn Ser Asn Ser Glu He Leu Ala Leu Gly Ala Asn 
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260 265 270 

Met Asp Lys Val Glu Ala Ala Phe Asn Phe Lys Leu Glu Asn Asn His 
275 280 285 

5- 

Ala Phe Leu Ala Gly Ala Val Ser Arg Lys Lys Gin Val Val Pro Gin 
290 295 300 

Leu Thr Glu Ser Phe Asn Thr 
10 305 310 



<210>*197 
<211> 225 
15 <212> PRT 

■ <213> Streptococcus pneumoniae 

<400> 197 

Met He Ser Lys Arg Leu Glu Leu Val Ala Ser Phe Val Ser Gin Gly 
20 1 5 10 15 

Ala He Leu Leu. Asp Val Gly Ser Asp His Ala Tyr Leu Pro He Glu 
20 25 . 30 

25 Leu Val Glu Arg Gly Gin He Lys Ser Ala He Ala Gly Glu Val Val 
35 40 45 

Glu Gly Pro Tyr Gin Ser Ala Val Lys Asn Val Glu Ala His Gly Leu 
50 55 60 

30 

Lys Glu Lys lie Gin Val Arg Leu Ala Asn Gly Leu Ala Ala Phe Glu 
65 70 ' 75 80 

Glu Thr Asp Gin Val Ser Val He Thr He Ala Gly Met Gly Gly Arg 
35 85 90 95 

Leu He Ala Arg He Leu Glu Glu Gly Leu Gly Lys Leu Ala Asn Val 
100 105 HO 

40 Glu Arg Leu He Leu Gin Pro Asn Asn Arg Glu Asp Asp Leu Arg He 
115 120 125 

Trp Leu Gin Asp His Gly Phe Gin He Val Ala Glu Ser He Leu Glu 
130 135 140 

45 

Glu Ala Gly Lys Phe Tyr Glu He Leu Val Val Glu Ala Gly Gin Met 
145 150 155 160 

Lys Leu Ser Ala Ser Asp Val Arg Phe Gly Pro Phe Leu Ser Lys Glu 
50 165 170 175 

Val Ser Pro Val Phe Val' Gin Lys Trp Gin Lys Glu Ala Glu Lys Leu 
180 185 190 

55 Glu Phe Ala Leu Gly Gin He Pro Glu Lys Asn Leu Glu Glu Arg Gin 
• 195 200 205 
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Val Leu Val Asp Lys He Gin Ala lie Lys Glu Val Leu His Val Ser 
210 215 220 

Lys 
225 



<210> 198 
<211> 161 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 198 

Met Asn Leu Asn Asp He Lys Asp Leu Met Thr Gin Phe Asp Gin Ser 
1 5 10 .15 

Ser Leu Arg Glu Phe Ser Tyr Lys Asn Gly Thr Asp Glu Leu Gin Phe 
20 25 30 

Ser Lys Asn Glu Ala Arg Pro Val Pro Glu Val Ala Thr Gin Val Ala 
35 40 45 

Pro Ala Pro Val Leu Ala Thr Pro Ser Pro Val Ala Pro Thr Ser Ala 
50 55 60 

Pro Ala Glu Thr Val Ala Glu Glu Val Pro Ala Pro Ala Glu Ala Ser 
65 70 75 " 80 

Val Ala Ser Glu Gly Asn Leu Val Glu Ser Pro Leu Val Gly Val Val 
85 90 95 

Tyr Leu Ala Ala Gly Pro Asp Lys Pro Ala Phe Val Thr Val Gly Asp 
100 105 110 

Ser Val Lys Lys Gly Gin Thr Leu Val He He Glu Ala Met Lys Val 
115 120 125 

Met Asn Glu He Pro Ala Pro Lys Asp Gly Val Val Thr Glu He Leu 
130 135 140 

Val Ser Asn Glu Glu Met Val Glu Phe Gly Lys Gly Leu Val Arg He 
145 150 155" - 160 

Lys 



<210> 199 
<211> 411 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 199 

Met Lys Leu Asn Arg Val Val Val Thr Gly Tyr Gly Val Thr Ser Pro 
1 5 10 15 

He Gly Asn Thr Pro Glu Glu Phe Trp Asn Ser Leu Ala Thr Gly Lys 
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20 25 

He Gly He Gly Gly He Thr Lys Phe 
35 40 

His Asn Ala Ala Glu He Gin Asp Phe 
50 55 

Lys Lys Asp Thr Asn Arg Phe Asp Asn 
65 70 

Ala Ala Gin Glu Ala Val Asn His Ala 
85 

Asn Arg Asp Arg Phe Gly Val He Val 
100 105 

Lys Glu He Glu Asp Gin Val Leu Arg 
115 120 

Arg Val Lys Pro Met Thr Leu Pro Lys 
130 135 



PCT/US00/35604 



30 

Asp His Ser Asp Phe Asp Val 
45 

Pro Phe Asp Lys Tyr Phe Val 
60 

Tyr Ser Leu Tyr Ala Leu Tyr 
75 80 

Asn Leu Asp Val Glu Ala Leu 
90 95 

Ala Ser Gly He Gly Gly He 
110 

Leu His Glu Lys Gly Pro Lys 
125 

Ala Leu Pro Asn Met Ala Ser 
140 



Gly Asn Val Ala Met Arg Phe Gly Ala Asn Gly Val Cys Lys Ser He 

145 150 155 160 

Asn Thr Ala Cys Ser Ser Ser Asn Asp Ala He Gly Asp Ala Phe Arg 
165 170 175 



Ser He Lys Phe Gly Phe Gin Asp 
180 

Ala Ser He Thr Pro Phe Ala He 
195 200 

Leu Ser Thr Thr Glu Asp Pro Thr 
210 215 

Asp Arg Asn Gly Phe Val Met Gly 
225 230 

Glu Ser Leu Glu His Ala Glu Lys 
245 

Val Val Gly Tyr Gly Asn Thr Cys 
260 

His Pro Glu Gly. Gin Gly Ala He 
275 280 

Glu Ala Glu He Ser Pro Glu Gin 
290 295 

Thr Ser Thr Pro Ala Asn Glu Lys 
305 310 

Val Leu Gly Lys Glu Val Pro Val 



Val Met Leu Val Gly Gly Thr Glu 
185 190 

Ala Gly Phe Gin Ala Leu Thr Ala 
205 

Arg Ala Ser He Pro Phe Asp Lys 
220 

Glu Gly Ser Gly Met Leu Val Leu 
235 240 

Arg Gly Ala Thr He Leu Ala Glu 
250 255 

Asp Ala Tyr His Met Thr Ser Pro 
265 270 

Lys- Ala He Lys Leu Ala Leu Glu 
285 

Val Ala Tyr Val Asn Ala His Gly 
300 

Gly Glu Ser Gly Ala He Val Ala 
315 320 

Ser Ser Thr Lys Ser Phe Thr Gly 
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325 

His Leu Leu Gly Ala Ala Gly Ala 
340 

Ala Met Arg His Asn Phe Val Pro 
355 360 

Ser Asp Tyr lie Glu Ala Asn Val 
370 375 

Glu lie Pro Tyr Ala lie Ser Asn 
385 390 

Ala Val Leu Ala Phe Lys Arg Trp 
405 



330 335 

Val Glu Ala He Val Thr He Glu 
345 350 

Met Thr Ala Gly Thr Ser Glu Val 
365 

Val Tyr Gly Gin Gly Leu Glu Lys 
380 • 

Thr Phe Gly Phe Gly Gly His Asn 
395 400 

Glu Asn Arg 
410 



<210> 200 
<211> 359 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 200 

Met Asn He Tyr Asp Gin Leu Gin Val Val Glu Asp Arg Tyr Glu Glu 
15 10 15 

Leu Gly Glu Leu Leu Ser Asp Pro Asp Val Val Ser Asp Thr Lys Arg 
20 25 30 

Phe Met Glu Leu Ser Lys Glu Glu Ala Ser Asn Arg Asp Thr Val He 
. 35 40 45 

Ala Tyr Arg Glu Tyr Lys Gin Val Leu Gin Asn He Val Asp Ala Glu 
50 55 " 60 

Glu Met He Lys Glu Ser Gly Gly Asp Ala Asp Leu Glu Glu Leu Ala 
65 70 75 80 

Lys Gin Glu Leu Lys Asp Ala Lys Ala Glu Lys Glu Glu Tyr Glu Glu 
85 90 95 

Lys Leu Lys He Leu Leu Leu Pro Lys Asp Pro Asn Asp Asp Lys Asn 
100 105 110 

He He Leu Glu He Arg Gly Ala Ala Gly Gly Asp Glu Ala Ala Leu 
115 120 125 

Phe Ala Gly Asp Leu Leu Thr Met Tyr Gin Lys Tyr Ala Glu Ala Gin 
130 135 140 

Gly Trp Arg Phe Glu Val Met Glu Ala Ser Met Asn Gly Val Gly Gly 
145 150 155 160 

Phe Lys Glu Val Val Ala Met Val Ser Gly Gin Ser Val Tyr Ser Lys 
165 170 175 
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Leu Lys Tyr Glu Ser Gly Ala His Arg Val Gin Arg Val Pro Val Thr 
180 185 190 

Glu Ser Gin Gly Arg Val His Thr Ser Thr Ala Thr Val Leu Val Met 
5 195 200 205 

Pro Glu Val Glu Glu Val Glu Tyr Asp lie Asp Pro Lys Asp Leu Arg 
210 215 220 

10 Val Asp lie Tyr His Ala Ser Gly Ala Gly Gly Gin Asn Val Asn Lys 
225 230 235 240 

• Val Ala Thr Ala Val Arg He Val His Leu Pro Thr Asn He Lys Val 
245 250 255 

15 

Glu Met Gin Glu Glu Arg Thr Gin Gin Lys Asn Arg Glu Lys Ala Met 
260 265 270 

Lys He He Arg Ala Arg Val Ala Asp His Phe Ala Gin He Ala Gin 
20 275 280 285 

Asp Glu Gin Asp Ala Glu Arg Lys Ser Thr He Gly Thr Gly Asp Arg 
290 295. 300 

25 Ser Glu Arg He Arg Thr Tyr Asn Phe Pro Gin Asn Arg Val Thr Asp 
. 305 310 315 320 

His Arg He Gly Leu Thr Leu Gin Lys Leu Asp Thr He Leu Ser Gly 
325 330 335 

30 

Lys Leu Asp Glu Val Val Asp Ala Leu Val Leu Tyr Asp Gin Thr Gin 
340 -345 350 

Lys Leu Glu Glu Leu Asn Lys 
35 355 



<210> 201 
<211> 559 
.40 ■ <212>. PRT 

<213> .Streptococcus pneumoniae 

<400> 201 

Met Ala Tyr Thr Leu Lys Pro Glu Glu Val Gly Val Phe Ala lie Gly 
45 1 5 10 15 

Gly Leu Gly Glu lie Gly Lys Asn Thr Tyr Gly He Glu Tyr Gin Asp 
20 25.. 30 

50 Glu lie lie lie Val Asp Ala Gly He Lys Phe Pro Glu Asp Asp Leu 
35 40 45 

Leu Gly He Asp Tyr Val He Pro Asp Tyr Ser Tyr He Val Asp Asn 
50 55 60 

55 

He Asp Arg Val Lys Ala Val Leu lie Thr His Gly His Glu Asp His 
65 70 75 8*0 
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lie Gly Gly lie Pro Phe Leu 
85 

5 Ala Gly Pro Leu Ala Leu Ala 
100 

Gly Leu Leu Arg Asn Ala Lys 
115 

10 

Leu Thr Phe Lys Asn Leu Lys 
130 135 

lie Pro Glu Pro Leu Gly lie 
15 145 150 



Leu Lys Gin Ala Asn Val Pro lie Tyr 
90 95 

Leu lie Arg Gly Lys Leu Glu Glu His 
105 110 

Leu Tyr Glu lie Asn His Asn Thr Glu 
120 125 

Ala Thr Phe Phe Arg Thr Thr His Ser 
140 

Val He His Thr Pro Gin Gly Lys He 
155 • 160 



Val Cys Thr Gly Asp Phe Lys 
165 

20 Ala Asp Leu His Arg Met Ala 
180 

Leu Leu Ser Asp Ser Thr Asn 
195 

25 

Glu Lys Val Val Gly Gin Ser 
210 215 

Gly Arg He He Phe Ala Ser 
30 225 230 



Phe Asp Phe Thr Pro Val Gly Glu Pro 
170 175 

Ala Leu Gly Glu Glu Gly Val Leu Cys 
185 190 

Ala Glu Val Pro Thr Phe Thr Asn Ser 
200 205 

He Met Lys He He Gin Gly He Glu 
220 

Phe Ala Ser Asn He Phe Arg Leu Gin 
235 240 



Gin Ala Thr Glu Ala Ala Val Lys Thr Gly Arg Lys He Ala Val Phe 
245 250 255 

35 Gly Arg Ser Met Glu Lys Ala He Val Asn Gly He Asp Leu Gly Tyr 

260 265 270 



lie Lys Ala Pro Lys Gly Thr 
275 

40 

Tyr Pro Ala Gly Glu Val Leu 
290 295 

Pro Met Ala Ala Leu Ser Arg 
45 305 310 

Gin Leu Gin Pro Gly Asp Thr 
325 

50 Gly Asn Thr Thr Ser Val Asn 
340 

Gly Val Glu Val He His Gly 
355 

55 

His Gly Gly Gin Gin Glu Gin 
370 375 



Phe He Glu Pro Asn Glu He Lys Asp 
280 285 

He Leu Cys Thr Gly Ser Gin Gly Glu 
300 

He Ala Asn Gly Thr His Arg Gin Val 
315 320 

Val He Phe Ser Ser Ser Pro He Pro 
.,330 335 

Lys Leu He Asn He He Ser Glu Ala 
345 350 

Lys Val Asn Asn He His Thr Ser Gly 
360 * 365 

Lys Leu Met Leu Cys Leu He Lys Pro 
380 
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Lys Tyr Phe Met Pro Val His Gly Glu Tyr Arg Met Gin Lys Val His 
385 390 395 400 

Ala Gly Leu Ala Val Asp Thr Gly Val Glu Lys Asp Asn lie Phe lie 
405 410 415 

Met Ser Asn Gly Asp Val Leu Ala Leu Thr Ala Asp Ser Ala Arg lie 
420 425 430 

Ala Gly His Phe Asn Ala Gin Asp lie Tyr Val Asp Gly Asn Arg lie 
435 440 445 

Gly Glu lie Gly Ala Ala Val Leu Lys Asp Arg Arg Asp Leu Ser Glu 
450 455 460 

Asp Gly Val Val Leu Ala Val Ala Thr Val Asp Phe Lys Ser Gin Met 
465 470 475 480 

lie. Leu Ser Gly Pro Asp He Leu Ser Arg Gly Phe Val Tyr Met Arg 
485 490 495. 

Glu Ser Gly Asp Leu He Arg Gin Ser Gin Arg He Leu Phe Asn Ala 
500 505 ' 510 

•He Arg He Ala Leu Lys Asn Lys Asp Ala Ser Val Gin Ser Val Asn 
515 520 525 

Gly Ala He Val Asn Ala He Arg Pro Phe Leu Tyr Glu Asn Thr Glu 
530 535 540 

Arg Glu Pro He He lie Pro Met He Leu Thr Pro Asp Glu Glu 
545 550 555 



<210> 202 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 202 

Met Ala Glu Val Glu Glu Leu Arg Val Gin Pro Gin Asp He Leu Ala 
15 10 15 

Glu Gin Ser Val Leu Gly Ala He Phe He Asp Glu Ser Lys Leu Val 
20 25 30 

Phe Val Arg Glu Tyr lie Glu Ser Arg. Asp Phe Phe Lys Tyr Ala His 
35 40 45 

Arg Leu He Phe Gin Ala Met Val Asp Leu Ser Asp Arg Gly Asp Ala 
50 "55 60 

He Asp Ala Thr Thr Val Arg Thr He Leu Asp Asn Gin Gly Asp Leu 
65 70 75 80 

Gin Asn He Gly Gly Leu Ser Tyr Leu Val Glu He Val Asn Ser Val 
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85 



90 



95 



Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys lie Val Ala Glu Lys 
100 • . 105 110 

Ala Met Leu Arg Arg Leu lie Ala Lys Leu Thr Glu Ser Val Asn Gin 
115 120 125 

Ala Tyr Glu Ala Ser Gin Pro Ala Asp Glu He He Ala Gin Ala Glu 
130 135 140 

Lys Gly Leu He Asp Val Ser Glu Asn Ala Asn Arg Ser Gly Phe Lys 
145 150 155 160* 

Asn He Arg Asp Val Leu Asn Leu Asn Phe Gly Asn Leu Glu Ala Arg 
165 170 175 

Ser Gin Gin Thr Thr Asp He Thr Gly He Ala Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp His Met Thr Thr Gly Leu His Glu Glu Glu Leu He He Leu 
195 200 205 

Ala Ala Arg Pro Ala Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
210 215 220 

Gin Asn He Gly Thr Lys Leu Asp Lys Thr Val Ala He Phe Ser Leu 
225 230 235 240 

Glu Met Gly Ala Glu Ser Leu Val Asp Arg Met Leu Ala Ala Glu Gly 
245 ' 250 '255 

Leu Val Glu Ser His Ser He Arg Thr Gly Gin Leu Thr Asp Glu Glu 
260 - 265 270 

Trp Gin Lys Tyr Thr He Ala Gin Gly Asn Leu Ala Asn Ala Ser lie 
275 280 285 

Tyr He Asp Asp Thr Pro Gly He Arg He Thr Glu He Arg Ser Arg 
290 295 300 

Ser Arg Lys Leu Ala Gin Glu Thr Gly Asn Leu Gly Leu He Val He 
305 310 315 320 

Asp Tyr Leu Gin Leu He Thr Gly Thr Gly Arg Glu Asn Arg Gin Gin 
325 330 335 

Glu Val Ser Glu He Ser Arg Gin Leu- Lys He Leu Ala Lys Glu Leu 
'340 345 350 

Lys Val Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly Val Glu Gin 
355 360 365 

Arg Gin Asp Lys Arg Pro Val Leu Ser Asp He Arg Glu Ser Gly Ser 
370 375 380 



He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg Asp Asp Tyr 
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385 390 395 400 

Tyr Glu Arg Gly Gly Glu Glu Glu Glu Gly lie Pro Asn Asn Lys Val 
405 410 415 

Glu Val lie He Glu Lys Asn Arg Ser Gly Ala Arg GlyThr Val Glu 
420 425 430 

Leu He Val Gin Lys Glu Tyr Asn Lys Phe Ser Ser He Ser Lys Arg 
435 440 445 

Glu Ala 
450 



<210> 203 
<211> 699 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 203 

Met Ala Thr Ala Thr Lys Lys Lys Lys Ser Thr Val Lys Lys Asn Leu 
15. 10 15 

Val He Val Glu Ser Pro Ala Lys Ala Lys Thr He Glu Lys Tyr Leu 
20 25 30 

Gly Arg Asn Tyr Lys Val Leu Ala Ser Val Gly His He Arg Asp Leu 
35 40 45 

Lys Lys Ser Ser Met Ser Val Asp He Glu Asn Asn Tyr Glu Pro Gin 
50 * 55 60 

Tyr He Asn lie Arg Gly Lys Gly Pro Leu lie Asn Asp Leu Lys Lys 
65 70 75 80 

Glu Ala Lys Lys Ala Asn Lys Val Phe Leu Ala Ser Asp Pro Asp Arg 
85 90 95 

Glu Gly Glu Ala lie Ser Trp His Leu Ala His lie Leu Asn Leu Asp 
100 105 ' ' 110 

Glu Asn Asp Ala Asn Arg Val Val Phe Asn Glu lie Thr Lys Asp Ala 
115 120 - 125 

Val Lys Asn Ala Phe Lys Glu Pro Arg Lys lie Asp Met Asp Leu Val 
130 135 140 

Asp Ala Gin Gin Ala Arg Arg lie Leu Asp Arg Leu Val Gly Tyr Ser 
145 150 155 160 

lie Ser Pro lie Leu Trp Lys Lys Val Lys Lys Gly Leu Ser Ala Gly 
165 170 175 

Arg Val Gin Ser lie Ala Leu Lys Leu lie lie Asp Arg Glu Asn Glu 
180 185 190 
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He Asn Ala Phe Gin Pro Glu Glu Tyr Trp Thr Val Asp Ala Val Phe 
195 200 ' 205 

Lys Lys Gly Thr Lys Gin Phe His Ala Ser Phe Tyr Gly Val Asp Gly 
210 215 220 

Lys Lys Met Lys Leu Thr Ser Asn Asn Glu Val Lys Glu Val Leu Ser 
225 230 235 240 

Arg Leu Thr Ser Lys Asp Phe Ser Val Asp Gin Val Asp Lys Lys Glu 
245 250 255 

Arg Lys Arg Asn Ala Pro Leu Pro Tyr Thr Thr Ser Ser Met Gin Met 
260 265 270 

Asp Ala Ala Asn Lys He Asn Phe Arg Thr Arg Lys Thr Met Met Val 
275 280 285 

Ala Gin Gin Leu Tyr Glu Gly He Asn He Gly Ser Gly Val Gin Gly 
290 295 300 

Leu He Thr Tyr Met Arg Thr Asp Ser Thr Arg He Ser Pro Val Ala 
305 310 315 320 

Gin Asn Glu Ala Ala Ser Phe He Thr Asp Arg Phe Gly Ser Lys Tyr 
325 330 335 

Ser Lys His Gly Ser Lys Val Lys Asn Ala Ser Gly Ala Gin Asp Ala 
340 345 350 

His Glu Ala He Arg Pro Ser Ser Val Phe Asn Thr Pro Glu Ser He 
355 360 365 

Ala Lys Tyr Leu Asp Lys Asp Gin Leu Lys Leu Tyr Thr Leu He Trp 
370 375 380 

Asn Arg Phe Val Ala Ser Gin Met Thr Ala Ala Val Phe Asp Thr Met 
385 390 395 400 

Ala Val Lys Leu Ser Gin Lys Gly Val Gin Phe Ala Ala Asn Gly Ser 
405 410 415 

Gin Val Lys Phe Asp Gly Tyr Leu Ala He Tyr Asn Asp Ser Asp Lys 
420 425 430 

Asn Lys Met Leu Pro Asp Met Val Val Gly Asp Val Val Lys Gin Val 
435 440 445 

Asn Ser Lys Pro Glu Gin His Phe Thr Gin Pro Pro Ala Arg Tyr Ser 
450 455 460 

Glu Ala Thr Leu He Lys Thr Leu Glu Glu Asn Gly Val Gly Arg Pro 
465 470 475 * 480 



Ser Thr Tyr Ala Pro Thr He Glu Thr He Gin Lys Arg Tyr Tyr Val 
485 490 495 
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Arg Leu Ala Ala Lys Arg Phe Glu Pro Thr Glu Leu Gly Glu lie Val 
500 505 510 

sn Lys Leu He Val Glu Tyr Phe Pro Asp He Val Asn Val Thr Phe 
515 520 525 - - 

Thr Ala Glu Met Glu Gly Lys Leu Asp Asp Val Glu Val Gly Lys Glu 
530 535 540 



Gin Trp Arg Arg Val He Asp Ala Phe Tyr Lys Pro Phe Ser Lys Glu 
545 550 555 560 

Val Ala Lys Ala Glu Glu Glu Met Glu Lys He Gin He Lys Asp Glu 
565 570 575 

Pro Ala Gly Phe Asp Cys Glu Val Cys Gly Ser Pro Met. Val He Lys 
580 585 590 

Leu Gly Arg Phe Gly Lys Phe Tyr Ala Cys Ser Asn Phe Pro Asp Cys 
595 600 605 

Arg His Thr Gin Ala He Val Lys Glu lie Gly Val Glu Cys Pro Ser 
610 .615 620 

Cys His Gin Gly Gin He He Glu Arg Lys Thr Lys Arg Asn Arg Leu 
625 630 635 640 

Phe Tyr Gly Cys Asn Arg Tyr Pro Glu Cys Glu Phe Thr Ser Trp Asp 
645 650 655 

Lys Pro Val Gly Arg Asp Cys. Pro Lys Cys Gly Asn Phe Leu Met Glu 
660 665 670 

Lys Lys Val Arg Gly Gly Gly Lys Gin Val Val Cys Ser Lys Gly Asp 
675 680 . 685 



Tyr Glu Glu Glu Lys Met Ala Leu Cys Gin Leu 
690 695 



<210> 204 

<211> 326 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 204 

Met Phe He Ser He Ser Ala Gly 
1 5 

Val Gly He Pro Ala Phe He Gin 
20 

Gly Gin Gin Met His Glu Asp Val 
35 40 

Thr Pro Thr Met Gly Gly Leu Val 



lie* Val Thr Phe Leu Leu Thr Leu 
10 15 

Phe Tyr Arg Lys Ala Gin .He Thr 
25 30 

Lys Gin His Gin Ala Lys Ala Gly 
45 

Phe Leu He Thr Ser Val Leu Val 
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50 



55 



60 



Ala Phe Phe Phe Ala Leu Phe Ser Ser Gin Phe Ser Asn Asn Val Gly 
65 70 75 80 

Met He Leu Phe He Leu Val Leu Tyr Gly Leu Val Gly Phe Leu Asp 
85 90 95 

Asp Phe Leu Lys Val Phe Arg Lys He Asn Glu Gly Leu Asn Pro Lys 
100 105 110 

Gin Lys Leu Ala Leu Gin Leu Leu Gly Gly Val He Phe Tyr Leu Phe 
115 120 125 

Tyr Glu Arg Gly Gly Asp Met Leu Ser Val Phe Gly Tyr Gin Val His 
130 135 140 

Leu Gly He Phe Tyr He Val Phe Ala Leu Phe Trp Leu Val Gly Phe 
145 150 155 160 

Ser Asn Ala Val Asn Leu Thr Asp Gly Val Asp Gly Leu Ala Ser lie 
165 170 175 

Ser Val Val He Ser Leu Ser Ala Tyr Gly Val He Ala Tyr Val Gin 
180 185 190 

Gly Gin Met Asp lie Leu Leu Val He Leu Ala Met He Gly Gly Leu 
195 200 205 

Leu Ser Phe Phe He Phe Asn His Lys Pro Ala Lys He Phe Met Gly 
210 215 220 

Asp Val Gly Ser Leu Ala Leu Gly Gly Met Leu Ala Ala He Ser Met 
225 230 235 240 

Ala Leu His Gin Glu Trp Thr Leu Leu He He Gly He Val Tyr Val 
245 250 255 

Phe Glu Thr Thr Ser Val Met Met Gin Val Ser Tyr Phe Lys Leu Thr 
260 265 270 

Gly Gly Lys Arg He Phe Arg Met Thr Pro Val His His His Phe Glu- 
275 280 285 

Leu Gly Gly Leu Ser Gly Lys Gly Asn Pro Trp Ser Glu Trp Lys Val 
290 295 300 

Asp Phe Phe Phe Trp Gly Val Gly Leu' teu Ala Ser Leu Leu Thr Leu 
305 310 315 320 



Ala He Leu Tyr Leu Met 
325 



<210> 205 
<211> 693 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 205 

Met Ala Arg Glu Phe Ser Leu Glu Lys Thr Arg Asn lie Gly lie Met 
15 10 15 

Ala His Val Asp Ala Gly Lys Thr Thr Thr Thr Glu Arg lie Leu Tyr 
20 25 30 

Tyr Thr Gly Lys He His Lys He Gly Glu Thr His Glu Gly Ala Ser 
35 40 45 . 

Gin Met Asp Trp Met Glu Gin Glu Gin Glu Arg Gly He Thr He Thr 
50 55 . 60 

Ser Ala Ala Thr Thr Ala* Gin Trp Asn Asn His Arg Val Asn He He 
65 70 75 80 

Asp Thr Pro Gly. His Val Asp Phe Thr He Glu Val Gin Arg Ser Leu 
85 90 95 

Arg Val Leu Asp Gly Ala Val Thr Val Leu Asp Ser Gin Ser Gly Val 
100 105 110 

Glu Pro Gin Thr Glu Thr Val Trp Arg Gin Ala Thr Glu Tyr Gly Val 
115 120 125 

Pro Arg lie Val Phe Ala Asn Lys Met Asp Lys He Gly Ala Asp Phe 
130 135 140 



Leu Tyr Ser Val Ser Thr Leu His Asp Arg Leu Gin Ala Asn Ala His 
145 . 150 155 160 

Pro He Gin Leu Pro He Gly Ser Glu Asp Asp Phe Arg Gly He He 
165 170 175 

Asp Leu He Lys Met Lys Ala Glu He Tyr Thr Asn Asp Leu Gly Thr 
180 " 185 190 



Asp He Leu Glu Glu Asp lie Pro 

195 200 

Glu Tyr Arg Glu Lys Leu He Glu 

210 215 

Leu Met Met Lys Tyr Leu Glu Gly 

225 230 

Lys Ala Gly He Arg Lys Ala Thr 
245 

Leu Cys Gly Ser Ala Phe Lys Asn 
260 

Ala Val He Asp Tyr Leu Pro Ser 

275 280 



Ala Glu Tyr Leu Asp Gin Ala Gin 
205 

Ala Val Ala Glu Thr Asp Glu Glu 
220 

Glu Glu He Thr Asn Glu Glu Leu 
235 240 

He Asn Val Glu Phe Phe Pro Val 
250 255 

Lys Gly Val Gin Leu Met Leu Asp 
265 270 

Pro Leu Asp He Pro Ala lie Lys 
285 
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Gly lie Asn Pro Asp Thr Asp Ala Glu Glu lie Arg Pro Ala Ser Asp 
290 295 300 

Glu Glu Pro Phe Ala Ala Leu Ala Phe Lys lie Met Thr Asp Pro Phe 
5 305 310 315 320 

Val Gly Arg Leu Thr Phe Phe Arg Val Tyr Ser Gly Val Leu Gin Ser 
325 330 335 

10 Gly Ser Tyr Val Leu Asn Thr Ser Lys Gly Lys Arg Glu Arg lie Gly 
340 345 350 



15 



Arg lie Leu Gin Met His Ala Asn Ser Arg Gin Glu lie Asp Thr Val 
355 360 365 

Tyr Ser Gly Asp He Ala Ala Ala Val Gly Leu Lys Asp Thr Thr Thr 

370 375 380 



Gly Asp Ser Leu Thr Asp Glu Lys Ala Lys He He Leu Glu Ser He 

20 385 390 395 400 

Asn Val Pro Glu Pro Val He Gin Leu Met Val Glu Pro Lys Ser Lys 

405 410 ' 415 

25 Ala Asp Gin Asp Lys Met Gly He Ala Leu Gin Lys Leu Ala Glu Glu 

420 425 430 



30 



Asp Pro Thr Phe - Arg Val Glu Thr Asn Val Glu Thr Gly Glu Thr Val 
435 440 445 

lie Ser Gly Met Gly Glu Leu His Leu Asp Val Leu Val Asp Arg Met 
. 450 *455 460 



Arg Arg Glu Phe Lys Val Glu Ala Asn Val Gly Ala Pro Gin Val Ser 

35 465 470 475 480 

Tyr Arg Glu Thr Phe Arg Ala Ser Thr Gin Ala Arg Gly Phe Phe Lys 
485 490 495 

40 Arg Gin Ser Gly Gly Lys Gly Gin Phe Gly Asp Val Trp He Glu Phe 
500 505 510 



45 



Thr Pro Asn Glu Glu Gly Lys Gly Phe Glu Phe Glu Asn Ala He Val 

515 520 525 

Gly Gly Val Val Pro Arg Glu Phe He Pro Ala Val Glu Lys Gly Leu 
530 535 540 



Val Glu Ser Met Ala Asn Gly Val Leu Ala Gly Tyr Pro Met Val Asp 
50 545 550 555 560 

Val Lys Ala Lys Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser 
565 .570 575 ' 



55 Glu Thr Ala Phe Lys He Ala Ala Ser Leu Ser Leu Lys Glu Ala Ala 
580 585 590 
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Lys Ser Ala Gin Pro Ala lie Leu Glu Pro Met Met Leu Val Thr lie 
595 600 605 

Thr Val Pro Glu Glu Asn Leu Gly Asp Val Met Gly His Val Thr Ala 
610 615 620 

Arg Arg Gly Arg Val Asp Gly Met Glu Ala His Gly Asn Ser Gin He 
625 - 630 635 640 

Val Arg Ala Tyr Val Pro Leu Ala Glu Met Phe Gly Tyr Ala Thr Val 
645 650 655 

Leu Arg Ser Ala Ser Gin Gly Arg Gly Thr Phe Met Met Val Phe Asp 
660 665 670 

His Tyr Glu Asp Val Pro Lys Ser Val Gin Glu Glu He He Lys Lys 
675 680 685 

Asn Lys Gly Glu Asp 
690 



<210> 206 
<211> 408 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 206 

Met Pro Asn Tyr Asn He Pro Phe Ser Pro Pro Asp He Thr Glu Ala 
15 10 15 

Glu lie Ala Glu Val Ala Asp Thr* Leu Arg Ser Gly Trp He Thr Thr 
20 25 30 

Gly Pro Lys Thr Lys Glu Leu Glu Arg Arg Leu Ser Leu Tyr Thr Gin 
35 40 45 

Thr Pro Lys Thr Val Cys Leu Asn Ser Ala Thr Ala Ala Leu Glu Leu 
50 55 60 

He Leu Arg Val Leu Glu Val Gly Pro Gly Asp Glu Val He Val Pro 
65 70 75 80 

Ala Met Thr Tyr Thr Ala Ser Cys Ser Val He Thr His Val Gly Ala 
85 90 95 

Thr Pro Val Met Val Asp He Gin Ala Asp Thr Phe Glu Met Asp Tyr 
100 105 - ■ 110 

Asp Leu Leu Glu Gin Ala He Thr Glu Lys Thr Lys Val He He Pro 
115 120 125 

Val Glu Leu Ala Gly He Val Cys Asp Tyr Asp Arg Leu Phe Gin Val 
130 135 140 

Val Glu Lys Lys . Arg Asp Phe Phe Thr Ala Ser Ser Lys Trp Gin Lys 
145 150 155 160 
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Ala Phe Asn Arg lie Val He Val Ser Asp Ser Ala His Ala Leu Gly 
165 170 175 

Ser Thr Tyr Lys Gly Gin Pro Ser Gly Ser He Ala Asp Phe Thr Ser 
180 185 190 

Phe Ser Phe His Ala Val Lys Asn Phe Thr Thr Ala Glu Gly Gly Ser 
195 200 205 



Ala Thr Trp Lys' Ala Asn Pro Val 
210 215 

Glu Phe Gin He Leu Ser Leu His 
225 230 

Lys Met Gin Leu Gly Ser Trp Glu 
245 

Lys Cys Asn Met Thr Asp He Met 
260 



He Asp Asp Glu Glu Met Tyr Lys 
220 

Gly Gin Thr Lys Asp Ala - Leu Ala 
235 240 

Tyr Asp He Val Thr Pro Ala Tyr 
250 255 

Ala Ser Leu Gly Leu Val Gin Leu 
265 270 



Asp Arg Tyr Pro Ser Leu Leu Gin Arg Arg Lys Asp He Val Asp Arg 
275 280 285 

Tyr Asp Ser Gly Phe Ala Gly Ser Arg He His Pro Leu Ala His Lys 

290 295 300 



Thr Glu Thr Val Glu Ser Ser Arg 
305 310 

Gly Ala Ser Leu Glu Glu Arg Ser 
325 

Ala Gly He Ala Ser Asn Val His 
340 



His Leu Tyr He Thr Arg Val Glu 
315 320 

Leu He He Gin Glu Leu Ala Lys 
330 335 

Tyr Lys Pro Leu Pro Leu Leu Thr 
345 350 



Ala Tyr Lys Asn Leu Gly Phe Asp Met Thr Asn Tyr Pro Lys Ala Tyr 
355 360 . 365 



Ala Phe Phe Glu Asn Glu He Thr Leu Pro Leu His Thr Lys Leu Ser 
370 375 380 



Asp Glu Glu Val Asp Tyr He He Glu Thr Phe Lys Thr Val Ser Glu 
385 390 395 400 

Lys Val Leu Thr Leu Ser Lys Lys 
405 



<210> 207 
<211> 325 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 207 

Met Thr Glu Pro Asp Phe Trp Asn Asp Asn He Ala Ala Gin Lys Thr 
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10 



15 



Ser Gin Glu Leu Asn Val Phe Lys Asn Thr Tyr Asn Thr Phe His Lys 
20 25 30 

5 

Met Glu Glu Leu Gin Asp Glu Val Glu He Leu Leu Asp Phe Leu Ala 
35 40 45 

Glu Asp Glu Ser Val His Asp Glu Leu Val Ala Gin Leu Ala Glu Leu 
10 50 * 55 60 

Asp Lys He Met Thr Ser Tyr Glu Met Thr Leu Leu Leu Ser Glu Pro 
65 70 - 75 80 

15 Tyr Asp His Asn Asn Ala He Leu Glu He His Pro Gly Ser Gly Gly 

85 90 95 



20 



Thr Glu Ala Gin Asp Trp Gly Asp Met Leu . Leu Arg Met Tyr Thr Arg 

100 105 110 

Tyr Gly Asn Ala Lys Gly Phe Lys Val Glu Val Leu Asp Tyr Gin Ala 
115 120 125 



Gly Asp Glu Ala Gly He Lys Ser Val Thr Leu Ser Phe Glu Gly Pro 
25 130 135 140 

Asn Ala Tyr Gly Leu Leu Lys Ser Glu Met Gly Val His Arg Leu Val 
145 150 155 160 

30 Arg He Ser Pro Phe Asp Ser Ala Lys Arg Arg His Thr Ser Phe Thr 

165 170 175 



35 



Ser Val Glu Val Met Pro Glu Leu Asp Asp Thr He Glu Val Glu He 
180 185 190 

Arg Glu Asp Asp He Lys Met Asp Thr Phe Arg Ser Gly Gly Ala Gly 
195 200 205 



Gly Gin Asn Val Asn Lys Val Ser Thr Gly Val Arg Leu Thr His He 
40 210 215 220 

Pro Thr Gly He Val Val Gin Ser Thr Val Asp Arg Thr Gin Tyr Gly 
225 230 235 240 

45 Asn Arg Asp Arg Ala Met Lys Met Leu Gin Ala Lys Leu Tyr Gin Met 

245 250 255 



50 



Glu Gin Glu Lys Lys Ala Ala Glu Val- Asp Ser Leu Lys Gly Glu Lys 

260 265 270 

Lys Glu He Thr Trp Gly Ser Gin He Arg Ser Tyr Val Phe Thr Pro 

275 280 285 



Tyr Thr Met Val Lys Asp His Arg Thr Ser Phe Glu Val Ala Gin Val 
55 290 295 300 



Asp Lys Val Met Asp Gly Asp Leu Asp Gly Phe He Asp Ala Tyr Leu 
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305 



310 



Lys Trp Arg lie Ser 
325 



315 



320 



<210> 208 
<211> 249 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 208 

Met Phe Tyr Thr Tyr Leu Arg Gly Leu Val Val Leu Leu Leu Trp Ser 
1 5 10 15 

lie Asn Gly Asn Ala His Tyr His Asn Thr Asp Lys He Pro Asn Gin 
20 25 30 

Asp Glu Asn Tyr He Leu Val Ala Pro His Arg Thr Trp Trp Asp Pro 
35 40 45 

Val Tyr Met Ala Phe Ala Thr Lys Pro Lys Gin Phe He Phe Met Ala 
50 55 60 

Lys Lys Glu Leu Phe Thr Asn Arg He Phe Gly Trp Trp He Arg Met 
65 70 75 80 

Cys Gly Ala Phe Pro He Asp Arg Glu Asn Pro Ser Ala Ser Ala He 
85 90 95 

Lys Tyr Pro He Asn Val Leu Lys Lys Ser Asp Arg Ser Leu He Met 
100 105 110 

Phe Pro Ser Gly Ser Arg His Ser Asn Asp Val Lys Gly Gly Ala Ala 
115 " 120 125 

Leu He Ala Lys Met Ala Lys Val Arg He Met Pro Val Thr Tyr Thr 
130 135 140 

Gly Pro Met Thr Leu Lys Gly Leu He Ser Arg Glu Arg Val Asp Met 
145 150 155 160 

Asn Phe Gly Asn Pro He Asp He Ser Asp He Lys Lys Met Asn Asp 
165 170 .175 

Glu Gly He Glu Thr Val Ala Asn Arg He Gin Thr Glu Phe Gin Arg 
180 185 190 

Leu Asp Glu Glu Thr Lys Gin Trp His Asn Asp Lys Lys Pro Asn Pro 
195 200 205 

Leu Trp Trp Phe He Arg He Pro Ala Leu lie Leu Ala He He Leu 
210 215 220 

Ala He Leu Thr He He Phe Ser Phe He Ala Ser Phe He Trp Asn 
225 230 235 240 
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Pro Asp Lys Lys Arg Glu Glu Leu Ala 
245 



<210> 209 
<211> 1033 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 209 

Met lie Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Glu Ser 
15 10 15 

Val He Ser He Glu Lys Tyr Val Arg Ala Ala Lys Glu Tyr Gly Tyr 
20 25 30 

Thr His Leu Ala Met Met Asp He Asp Asn Leu Tyr Gly Ala Phe Asp 
35 40 45 

Phe Leu Glu He Thr Lys Lys Tyr Gly He. His Pro Leu Leu Gly Leu 
50 55 60 

Glu Met Thr Val Phe Val Asp Asp Gin Gly Val Asn Leu Arg Phe Leu 
65 70 75 80 

Ala Leu Ser Ser Val Gly Tyr Gin Gin Leu Met Lys Leu Ser Thr Ala 
85 90 95 

Lys Met Gin Gly Glu Lys Thr Trp Ser Val Leu Ser Gin Tyr Leu Glu 
. 100 105 110 

Asp He Ala Val He Val Pro Tyr Phe Asp Arg Val Glu Ser Leu Glu 
115 120 125 

Leu Gly Cys Asp Tyr Tyr He Gly Val Tyr Pro Glu Thr Leu Ala Ser 
130 135 140 

Glu Phe His His Pro He Leu Pro Leu Tyr Arg Val Asn Ala Phe Glu 
145 150 ' 155 160 

Ser Arg Asp Arg Glu Val Leu Gin Val Leu Thr Ala He Lys Glu Asn 
165 170 175 

Leu Pro Leu Arg Glu Val Pro Leu Arg Ser Arg Glh Asp Val Phe He 
180 185 190 

Ser Ala Ser Ser Leu Glu Lys Leu Phe Gin Glu Arg Phe Pro Gin Ala 
195 200 205 

Leu Asp Asn Leu Glu Lys Leu He Ser Gly He Ser Tyr Asp Leu Asp 
210 215 220 

Thr Ser Leu Lys Leu Pro Arg Phe Asn Pro Ala Arg Pro Ala Val Glu 
225 230 235 240 

Glu Leu Arg Glu Arg Ala Glu Leu Gly Leu Val Gin Lys Gly Leu Thr 
■ • 245 250 255 
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Ser Lys Glu Tyr Gin Asp Arg Leu Asp Gin Glu Leu Ser Val lie His 
260 265 270 

Asp Met Gly Phe Asp Asp Tyr Phe Leu Val Val Trp Asp Leu Leu Arg 

275 280 285 



10 



Phe Gly Arg Ser Asn Gly Tyr Tyr Met Gly Met Gly Arg Gly Ser Ala 

290 295 300 

Val Gly Ser Leu Val Ser Tyr Ala Leu Asp lie Thr Gly lie Asp Pro 
305 310 315 320 



Val Glu Lys Asn Leii lie Phe Glu Arg Phe Leu Asn Arg Glu Arg Tyr 
15 325 330 335 

Thr Met Pro Asp lie Asp lie Asp lie Pro Asp lie Tyr Arg Pro Asp 
340 345 350 

20 Phe lie Arg Tyr Val Gly Asn Lys Tyr Gly Ser Lys His Ala Ala Gin 
355 360 365 



-25 



He Val Thr Phe Ser Thr Phe Gly Ala Lys Gin Ala Leu Arg Asp Val 
370 . 375 380 

Leu Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Ser Ala He Thr Lys 
385 • 390 395 400 



' Lys He Ser Phe Arg Asp Asn Leu Lys Ser Ala Tyr Glu Gly Asn Leu 
30 405 410 415 

'4 

Gin Phe Arg Gin Gin He Asn Ser Lys Leu Glu Tyr Gin Lys Ala Phe 
420 425 430 

35 Glu' He Ala Cys Lys He Glu Gly Tyr Pro Arg Gin Thr Ser Val His 
435 440 445 



40 



Ala Ala Gly Val Val He Ser Asp Gin Asp Leu Thr Asn Tyr He Pro 
450 455 460 

Leu Lys Tyr Gly Asp Glu He Pro Leu Thr Gin Tyr Asp Ala His Gly 
465 470 ' 475 480 



Val Glu Ala Ser Gly Leu Leu Lys Met Asp Phe Leu Gly Leu Arg Asn 
45 485 490 495 

Leu Thr Phe Val Gin Lys Met Gin Glu Leu Leu Ala Glu He Glu Gly 
500 505- ■ 510 

50 He His Leu Lys He Glu Glu lie Asp Leu Glu Asp Lys Glu Thr Leu 
515 520 525 



55 



Asp Leu Phe Ala Ser Gly Asn Thr Lys Gly He Phe Gin Phe Glu Gin 

530 535 540 

Pro Gly Ala He Arg Leu Leu Lys Arg. Val Gin Pro Val Cys Phe Glu 

545 550 555 560 
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Asp Val Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser Asp Tyr 
565 570 575 

He Asn Asn Phe Val Ala Arg Lys His Gly Gin Glu Glu Val Thr Val 
580 585 590 



10 



Leu Asp Pro Val Leu Glu Asp He Leu Ala Pro Thr Tyr Gly He Met 
595 600 605 

Leu Tyr Gin Glu Gin Val Met Gin Val Ala Gin Arg Phe Ala Gly Phe 
610 615 620 



Ser Leu Gly Lys Ala Asp He Leu Arg Arg Ala Met Gly Lys Lys Asp 
15 625 630 635 640 

Ala Ser Ala Met His Glu Met Arg Ala Ser Phe He Gin Gly Ser Leu 

645 650 655 

20 Glu Ala Gly His Thr Val Glu Lys Ala Glu Gin Val Phe Asp Val Met 
660 665 670 



25 



Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Tyr Ala Tyr 
675 ' 680 685 

Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Thr His Tyr Pro Ala 
690 695 700 



He Phe Tyr Gin He Met Leu Asn Ser Ala Asn Ser Asp Tyr Leu He 
30 705 710 715 720 

Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Pro Leu Ser He Asn Thr 
725 730 735 

35 He Pro Tyr His* Asp Lys He Ala Asn Lys Ala He Tyr Leu Gly Leu 

740 745 750 



40 



Lys Ser He Lys Gly Val Ser Asn Asp. Leu Ala Leu Trp lie He Glu 
755 760 765 

His Arg Pro Tyr Ser Asn He Glu Asp Phe He Ala Lys Leu Pro Glu 
770 775 780 



Asn Tyr Leu Lys Leu Pro Leu Leu Glu Pro Leu Val Lys Val Gly Leu 
45 785 790 795 800 

Phe Asp Ser Phe Glu Lys Asn Arg Gin Lys Val Phe Asn Asn Leu Ala 
805 "810 815 

50 Asn Leu Phe Glu Phe Val Lys Glu Leu Gly Ser Leu Phe Gly Asp Ala 
820 825 830 



55 



He Tyr Ser Trp Gin Glu Ser Glu Asp Trp Thr Glu Gin Glu Lys Phe 
835 840 845 

Tyr Met Glu Gin Glu Leu Leu Gly He Gly Val Ser Lys His Pro Leu 
850 855 860 
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Gin Ala He Ala Ser Lys Ala He Tyr Pro He Thr Pro He Gly Asn 
865 870 875 880 

5 Leu Ser Glu Asn Ser Tyr Ala lie He Leu Val Glu Val Gin Lys He 

885 890 - 895 

Lys Val He Arg Thr Lys Lys Gly Glu Asn Met Ala Phe Leu Gin Ala 
900 905 910 

10 

Asp Asp Ser Lys Lys Lys Leu Asp Val Thr Leu Phe Ser Asp- Leu Tyr 
915 920 925 

Arg Gin Val Gly Gin Glu He Lys Glu Gly Ala Phe Tyr Tyr Val Lys 
15 930 935 940 

Gly Lys He Gin Ser Arg Asp Gly Arg Leu Gin Met He Ala Gin Glu 
945 950 955 960 

20 He Arg Glu Ala Val Ala Glu Arg Phe Trp He Gin Val Lys Asn His 

965 ' 970 975 



25 



Glu' Ser Asp Gin' Glu He Ser Arg He Leu Glu Gin Phe Lys Gly Pro 
980 985 990 

He Pro Val He He Arg Tyr Glu Glu Glu Gin Lys Thr He Val Ser 
995 1000 1005 



Pro His His Phe Val Ala Lys Ser Asn Glu Leu Glu Glu Lys Leu Asn 
30 1010 . 1015 1020 



35 



40 



Glu He Val Met Lys Thr He Tyr Arg 
1025 1030 



<210> 210 
<211> 306 
<212> PRT 

<213> Streptococcus pneumoniae 



• <400> 210 

Met Thr Asn Glu Phe Leu His Phe Glu Lys He Ser Arg Gin Thr Trp 
1 5 " 10 15 

45 Gin Ser Leu His Arg Lys Thr Thr Pro Pro Leu Thr Glu Glu Glu Leu 
20 25 30 

Glu Ser He Lys Ser Phe Asn Asp Gin lie Ser Leu Gin Asp Val Thr 
35 40 45 

50 

Asp He Tyr Leu Pro Leu Ala His Leu He Gin He Tyr Lys Arg. Thr 
50 55 60 

Lys Glu Asp Leu Ala Phe Ser Lys Gly He Phe Leu Gin Arg Glu Ser 
55 65 70 75 80 

Lys Ser Gin Pro Phe He He Gly Val Ser Gly Ser Val Ala Val Gly 
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85 



90 



95 



Lys Ser Thr Thr Ser Arg Leu Leu Gin lie Leu Leu Ser Arg Thr Phe 
100 105 110 ' 

Thr Asp Ala Thr Val Glu Leu Val Thr Thr Asp Gly Phe Leu Tyr Pro 
115 120 125 

Asn Gin Thr Leu lie Glu Gin Gly He Leu Asn Arg Lys Gly- Phe Pro 
130 135 ' 140 

Glu Ser Tyr Asp Met Glu Ala Leu Leu Asn Phe Leu Asp Arg He Lys 
145 150 155 160 

Asn Gly Gin Asp Val Asp He Pro Val Tyr Ser His Glu Val Tyr Asp 
165 170 175 

He Val Pro Lys Lys Lys Gin Ser Val Lys Ala Ala Asp Phe Val He 
180 185 190 

Val Glu Gly He Asn Val Phe Gin Asn Pro Gin Asn Asp Arg Leu Tyr 
195 200 205 

He Thr Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Gly Val Asp 
210 215 220 

Asp He Glu Ser Trp Tyr Leu Asp Arg Phe Leu Lys Met Leu Ser Leu 
225 230 235 240 

Ala Gin' Asn Asp Pro Asp Ser Tyr Tyr Tyr Arg Phe Thr Gin Met Pro 
245 250 255 

He Gly Glu Val Glu Ser Phe Ala His Gin Val Trp Thr Ser He Asn 
260 265 270 

Leu Thr Asn Leu Gin Asn Tyr He Glu Pro Thr Arg Asn Arg Ala Glu 
- 275 280 285 

Val He Leu His Lys Ser Lys Asn His Glu He Asp Glu He Tyr Leu 
290 295 300 

Lys Lys 
305 



<210> 211 
<211> 246 
<212> PET 

<213> Streptococcus pneumoniae 
<400> 211 

Met Glu He Ser Leu Leu Thr Asp Val Gly Gin Lys Arg Thr Asn Asn 
1 5 10 15 - 

Gin Asp Tyr Val Asn His Tyr Val Asn Arg Ala Gly Arg Thr Met He 



20 



25 



30 
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He Leu Ala Asp Gly Met Gly Gly His Arg Ala Gly Asn He Ala Ser 
35 40 45 

Glu -Met Ala Val Thr Asp Leu Gly Val Ala Trp Val Asp Thr Gin He 
5 50 55 . 60 

Asp Thr Val Asn Glu Val Arg Glu Trp Phe Ala His Tyr Leu Glu He 
65 70 75 80 

10 Glu Asn Gin Lys He His Gin Leu Gly Gin Asp Glu Ala Tyr Arg Gly 

85 90 95 



15 



Met Gly Thr Thr Leu Glu Val Leu Ala He He Asp Asn Gin Ala lie 
100 105 110 

Tyr Ala His He Gly Asp Ser Arg He Gly Leu He Arg Gly Glu Glu 
115 120 125 



Tyr His Gin Leu Thr Ser Asp His Ser Leu Val Asn Glu Leu Leu Lys 
20 130 135 140 

Ala Gly Gin Leu Thr Pro. Glu Glu Ala Glu Ala His Pro Gin Lys Asn 
145 150 155 160 

25 He He Thr Gin Ser He Gly Gin Lys Asp Glu He Gin Pro Asp Phe 

165 170 175 



30 



Gly Thr Val He Leu Glu Ser Gly Asp Tyr- Leu Leu Leu Asn Ser Asp 
180 185 190 

Gly Leu Thr Asn Met He Ser. Gly Ser Glu He Arg Asp He Val Thr 
195 200 205 



Ser Asp He Pro Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala 
35 210 215 220 

Asn Asn Ala Gly Gly Leu Asp Asn He Thr Val Ala Leu Val Ser Met 

225 230 235 240 



40 Asn Glu Glu Asp Glu Glu 

245 



<210> 212 
45 <211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 212 

50 Met Thr He Gin Met Lys Asn Thr Gly Lys Arg He Asp Leu He Ala 
1 5 10 15 

Asn Arg Lys Pro Gin Ser Gin Arg Val Leu Tyr Glu Leu Arg Asp Arg 
20 25 30 



55 



Leu Lys Arg Asn Gin Phe He Leu Asn Asp Thr Asn Pro Asp He Val 
35 40 45 
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lie Ser lie Gly Gly Asp Gly Met' Leu Leu Ser Ala Phe His Lys Tyr 
50 55 60 

5 Glu Asn Gin Leu Asp* Lys Val Arg Phe He Gly Leu His Thr Gly His 
65 70 75 80 

Leu Gly Phe Tyr Thr Asp Tyr Arg Asp' Phe Glu Leu Asp Lys Leu Val 
85 90 95 

10 

Thr Asn Leu Gin Leu Asp Thr Gly Ala Arg Val Ser Tyr Pro Val Leu 
100 105 110 

Asn Val Lys Val Phe Leu Glu Asn Gly Glu Val Lys He Phe Arg Ala 
15 115 120 125 

Leu Asn Glu Ala Ser He Arg Arg Ser Asp Arg. Thr Met Val Ala Asp 
130 135 140 

20 He Val He Asn Gly Val Pro Phe Glu Arg Phe Arg Gly Asp Gly Leu 
145 150 155 160 

Thr Val Ser Thr Pro Thr Gly Ser Thr Ala Tyr Asn Lys Ser Leu Gly 
165 170 175 

25' 

Gly Ala Val Leu His Pro Thr He Glu Ala Leu Gin Leu Thr Glu He 
180 185 190 

Ala Ser Leu Asn Asn Arg Val Tyr Arg Thr Leu Gly Ser Ser He He 
30 195 200 205 

Val Pro Lys Lys Asp Lys He Giii Leu He Pro Thr Arg Asn Asp Tyr 
210 - 215 220 

35 His Thr He Ser Val Asp Asn Ser Val Tyr Ser Phe Arg Asn He Glu 
225 230 235 240 

Arg He Glu Tyr Gin He Asp His His Lys He His Phe Val Ala Thr 
245 250 255 

40 

Pro Ser His Thr Ser Phe Trp Asn Arg Val Lys Asp Ala Phe He Gly 
260 265 270 ■ 

Glu Val Asp Glu 
45 275 



<210> 213 
<211> 540 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 213 

Met Ser Lys Glu He Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val 
55 1 5 10 15 

Arg Gly Val Asp He Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
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20 



25 



30 



Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro Leu lie 
35 40 45 

Thr Asn Asp Gly Val Thr lie Ala Lys Glu lie Glu Leu Glu Asp His 
50 55 60 

Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
65 70 75 80 

* Asn Asp lie Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 
85 90 95 

Ala lie Val Arg Glu Gly lie Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 110 

He Gly He Arg Arg Gly He Glu Thr Ala Val Ala Ala Ala Val Glu 
115 120 125 

Ala Leu Lys Asn Asn Ala He Pro Val Ala Asn Lys Glu Ala He Ala 
130 135 140 

Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr He 
145 150 155 160 

Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val lie Thr He Glu 
165 170 . 175 

Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 - 190 

Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys 
195 200 205 

Met Val' Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr Asp Lys Lys 
210 215 220 

lie Ser Asn He Gin Glu lie Leu Pro Leu Leu Glu Ser He Leu Gin 
225 230 235 240 

Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 
245 250 255 

Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr. Phe Asn Val Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 

Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg 
305 310 315 320 



Val Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn 
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325 330 335 

Pro Glu Ala lie Ser His Arg Val Ala Val He Lys Ser Gin He Glu 
340 345 350 

Thr. Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ser Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Glu Leu Lys Glu Met Lys Leu Arg He Glu Asp Ala Leu Asn Ala 
385- * 390 -3-95 400 

Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Thr Ala 
405 410 415 

Leu Ala Asn Val He Pro Ala Val Ala Thr Leu Glu Leu Thr Gly Asp 
420 425 430 

Glu Ala Thr Gly Arg Asn He Val Leu Arg Ala Leu Glu Glu Pro Val 
435 440 445 

Arg Gin He Ala His Asn Ala Gly Phe Glu Gly Ser lie Val He Asp 
450 455 460 

Arg Leu Lys Asn Ala Glu Leu Gly He Gly Phe Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Trp Val Asn Met He Asp Gin Gly He He Asp Pro Val Lys Val 
485 490 . 495 

Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu lie Leu 
• 500 505 510 

Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro Val Ala Pro Ala 
515 520 525 * 

Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Met 
530 535 540 



<210> 214 

<211> 481 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 214 

Met lie Lys He Glu Thr Val Leu Asp- He Leu Lys Lys Asp Gly Leu. 
1 5 10 15 

Phe Arg Glu He He Asp Gin Gly- His Tyr His Tyr Asn Tyr Ser Lys 
20 25 30 

Val He Phe Asp Ser lie Ser Tyr Asp Ser Arg Lys Val Thr Glu Asp 
35 40 45 
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Thr Leu Phe Phe Ala Lys Gly Ala Ala Phe Lys Lys Glu Tyr Leu Leu 
50 55 60 

Ser Ala He Thr Gin Gly Leu Ala Trp Tyr Val Ala Glu Lys Asp Tyr 
5 65 70 75 80 

Glu Val Gly He Pro Val He He Val Asn Asp He Lys Lys Ala Met 
85 90 95 

10 Ser Leu He Ala Met Glu Phe Tyr Gly Asn Pro Gin Glu Lys Leu Lys 
100 105 110 



15 



Leu Leu Ala Phe Thr Gly Thr Lys Gly Lys Thr Thr Ala Ala Tyr Phe 
115 120 125 

Ala Tyr Asn He Leu Ser Gin Gly His Arg Pro Ala Met Leu Ser Thr 
130 135 140 



Met Asn Thr Thr Leu Asp Gly Glu Thr Phe Phe Lys Ser Ala Leu Thr 
20 145 150 155 160 

Thr Pro Glu Ser He Asp Leu Phe Asp Met Met Asn Gin Ala Val Gin 
165 170 175 

25 .Asn Asp Arg Thr His Leu He Met Glu Val Ser Ser Gin Ala Tyr Leu 
180 185 190 



30 



Val Lys Arg Val' Tyr Gly Leu Thr Phe Asp Val Gly Val Phe Leu Asn 
195 200 205 

He* Ser Pro Asp His He Gly Pro He Glu His Pro Ser Phe Glu Asp 
210 215 220 



Tyr Phe Tyr His Lys Arg Leu Leu Met Glu Lys Ser Arg Ala Val lie 
35 225 230 235 240 

He Asn Ser Asp Met Asp His Phe Ser Val Leu Lys Glu Gin Val Glu 
245 250 255 

40 Asp Gin Asp His Asp Phe Tyr Gly Ser- Gin Phe Asp Asn Gin He Glu 
260 265 270 



45 



Asn Ser Lys Ala Phe Ser Phe Se.r Ala Thr Gly Lys Leu Ala Gly Asp 
275 280 285 

Tyr Asp -lie Gin Leu He Gly Asn Phe Asn Gin Glu Asn Ala Val Ala 
. 290 295 300 



Ala Gly Leu Ala Cys Leu Arg Leu Gly Ala Ser Leu Glu Asp He Lys 

50 305 310 315 320 

Lys Gly He Ala Ala Thr Arg Val Pro Gly Arg Met Glu Val Leu Thr 
325 330" 335 . 



55 ' Gin Lys Asn Gly Ala Lys Val Phe He Asp Tyr Ala His Asn Gly Asp 
340 345 350 
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Ser Leu Lys Lys Leu He Asn Val Val Glu Thr His Gin Thr Gly Lys 
355 360 365 

He Ala Leu Val Leu Gly Ser Thr Gly Asn Lys Gly Glu Ser Arg Arg 
5 370 375 380 

Lys Asp Phe Gly Leu Leu Leu Asn Gin His Pro Glu lie Gin Val Phe 
385 390 395 400 

10 Leu Thr Ala Asp Asp Pro Asn Tyr Glu Asp Pro Met Ala He Ala Asp 

405 410 415 



15 



Glu He Ser Ser Tyr He Asn His Pro Val Glu Lys lie Ala Asp Arg 
420 425 430 

Gin Glu Ala He Lys Ala Ala Met Ala He Thr ' Asn His Glu Leu Asp 
435 440 445 



Ala Val He He Ala Gly Lys Gly Ala Asp Cys Tyr Gin He He Gin 
20 450 455 460 

Gly Lys Lys Glu Ser Tyr Pro Gly Asp Thr Ala Val Ala Glu Asn Tyr 

465 470 475 480 



25 Leu 



<210> 215 
30 <211> 659 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 215 

35 Met He Gin He Gly Lys lie Phe Ala Gly Arg Tyr Arg He Val Lys 
1 5 10 15 

Gin He Gly Arg Gly Gly Met Ala Asp Val Tyr Leu Ala Lys Asp Leu 
20 25 30 



40 



He Leu Asp Gly Glu Glu Val Ala Val Lys Val Leu Arg Thr Asn Tyr 
35 40 45 



Gin Thr Asp Pro He Ala Val Ala Arg Phe Gin Arg Glu Ala Arg Ala 
45 50 55 60 

Met Ala Asp Leu Asp His Pro His lie Val Arg He Thr Asp He Gly 
65 70 75 80 

50 Glu Glu Asp Gly Gin Gin Tyr Leu Ala Met Glu Tyr Val Ala Gly Leu 

85 90 95 . 

Asp Leu Lys Arg Tyr He Lys Glu His Tyr Pro Leu Ser Asn Glu Glu 
100 105 HO 

55 

Ala Val Arg He Met Gly Gin He Leu Leu Ala Met Arg Leu Ala His 
115 120 125 
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10 



Thr Arg Gly lie Val His Arg Asp Leu Lys Pro Gin Asn lie Leu Leu 
-130 135 140 

Thr Pro Asp Gly Thr Ala Lys Val Thr Asp Phe Gly He Ala Val Ala 
145- 150 155 160 

Phe Ala Glu Thr Ser Leu Thr Gin Thr Asn Ser Met Leu Gly Ser Val 
165 170 175 

His Tyr Leu Ser Pro Glu Gin Ala Arg Gly. Ser Lys Ala Thr Val Gin 
180 185 190 



Ser Asp He Tyr Ala Met Gly He He Phe Tyr Glu Met Leu Thr Gly 
15 195 200 205 

His He Pro Tyr Asp Gly Asp Ser Ala Val Thr He Ala Leu Gin His 
210 215 220 

20 Phe Gin Lys Pro Leu Pro Ser Val He Ala Glu Asn Pro Ser Val Pro 
225 230 235 240 



25 



Gin Ala Leu Glu Asn Val He He Lys Ala Thr Ala Lys Lys Leu Thr 
245 250 255 

Asn Arg Tyr Arg Ser Val Ser Glu Met Tyr Val Asp Leu Ser Ser Ser 
260 265 270 



Leu Ser Tyr Asn Arg Arg Asn Glu Ser Lys Leu lie Phe Asp Glu Thr 
30 275 280 285 

Ser Lys Ala Asp Thr Lys Thr Leu Pro Lys Val Ser Gin Ser Thr Leu 

290 295 300 

35 Thr Ser He Pro Lys Val Gin- Ala Gin Thr Glu His Lys Ser He Lys 

305 310 315 320 



40 



Asn Pro Ser Gin Ala. Val Thr Glu Glu Thr Tyr Gin Pro Gin Ala Pro 
325 330 335 

Lys Lys His Arg Phe Lys Met Arg Tyr Leu He Leu Leu Ala Ser Leu 
340 345 350 



Val Leu Val Ala Ala Ser Leu He Trp He Leu Ser Arg Thr Pro Ala 
45 355 . 360 365 

Thr He Ala He Pro Asp Val Ala Gly Gin Thr Val Ala Glu Ala Lys 
370 ' 375 " 380 

50 Ala Thr Leu Lys Lys Ala. Asn Phe Glu He Gly Glu Glu Lys Thr Glu 
385 390 395 400 



55 



Ala Ser Glu Lys Val Glu Glu ,Gly Arg He He Arg Thr Asp Pro Gly 
405 410 415 

Ala Gly Thr Gly Arg Lys Glu Gly Thr Lys He Asn Leu Val Val Ser 
420 425 * 430 
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Ser Gly Lys Gin Ser Phe Gin He Ser Asn Tyr Val Gly Arg Lys Ser 
435 440 445 

Ser Asp Val He Ala Glu Leu Lys Glu Lys Lys Val Pro Asp Asn Leu 
450 455 460 

He Lys He Glu Glu Glu Glu Ser Asn Glu Ser Glu Ala Gly Thr Val 
465 470 475 480 

Leu Lys Gin Ser Leu Pro Glu Gly Thr Thr Tyr Asp Leu Ser Lys Ala 
485 490 495 

Thr Gin He Val Leu Thr Val Ala Lys Lys Ala Thr Thr He Gin Leu 
500 505 510 

Gly Asn Tyr He Gly Arg Asn Ser Thr Glu Val He Ser Glu Leu Lys 
515 520 525 

Gin Lys Lys Val Pro Glu Asn Leu He Lys He Glu Glu Glu Glu Ser 
530 535 540 

Ser Glu Ser Glu Pro Gly Thr He Met Lys Gin Ser Pro Gly Ala Gly 
545 550 555 560 

Thr Thr Tyr Asp Val Ser Lys Pro Thr Gin He Val Leu Thr Val Ala 
565 570 575 

Lys Lys Val Thr Ser Val Ala Met Pro Ser Tyr He Gly Ser Ser Leu 
580 585 590 . * 

Glu Phe Thr Lys Asn Asn Leu He Gin lie Val Gly He Lys Glu Ala 
595 600 605 

Asn He Glu Val Val Glu Val Thr Thr Ala Pro Ala Gly Ser Ala Glu 
610 615 620 

Gly Met Val Val Glu Gin Ser Pro Arg Ala Gly Glu Lys Val Asp Leu 
625 630 635 640 

Asn Lys Thr Arg Val Lys He Ser He Tyr Lys Pro Lys Thr Thr Ser 
645 650 655 



Ala Thr Pro 



<210> 216 
<211> 391 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 216 

Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 
15 10 15 

Met Ala Thr He Ser Ser Asn Phe Tyr Gly Gin Lys Thr Leu Leu He 
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20 



25 



30 



10 



Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 
35, 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 
50 55 60 

lie Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
65 70 75 80 

Asn His Asp lie lie Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 
85 90 95 



15 Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 
100 105 110 



20 



25 



He He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 

Ala Thr Gin He Glu He Val Ser Val Lys Lys Val Asp Asp Gin Phe 
130 135 140 

Val Leu Lys Ser Ala Asp Gin Thr Phe Thr Cys Glu Lys Leu He Val 
145 150 155 160 

Thr Thr Gly Gly Lys Ser Tyr Pro Ser Thr Gly Ser Thr Gly Phe Gly 
165 170 175 



30 His Glu He Ala Arg His Phe Lys His Thr lie Thr Asp Leu Glu Ala 
180 185 190 



35 



40 



Ala Glu Ser Pro Leu Leu Thr Asp Phe Pro His Lys Ala Leu Gin Gly 
195 200 205 

He Ser Leu Asp Asp Val Thr Leu Ser Tyr Gly Lys His Val He Thr 
210 215 220 

His Asp Leu Leu Phe Thr His Phe Gly Leu Ser Gly Pro Ala Ala Leu 
225 230 235 240 

Arg Met Ser Ser Phe Val Lys Gly Gly Glu Val Leu Ser Leu Asp Val 
245- 250 255 



45 



50 



55 



Leu Pro Gin Leu Ser Glu Lys Asp Leu Val Thr Phe Leu Glu Glu Asn 
260 265 270 

Arg Glu Lys Ser Leu Lys Asn Ala Leu Lys Thr Leu Leu Pro Glu Arg 
275 280 285 

Leu Ala Glu Phe Phe Val Gin Gly Tyr Pro Glu Lys Val Lys Gin Leu 
290 295 300 

Thr Glu Lys Glu Arg Glu Gin Leu Val Gin Ser He Lys Glu Leu Lys 
305 310 315 320 

He Pro Val Thr Gly Lys Met Ser Leu Ala Lys Ser Phe Val Thr Lys 
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325 330 335 

Gly Gly Val Ser Leu Lys Glu lie Asn Pro Lys Thr Leu Glu Ser Lys 
340 ' 345 350 

Leu Vai Pro Gly Leu His Phe Ala Gly Glu Val Met Asp lie Asn Ala 
355 360 365 

His Thr Gly Gly Phe Asn He Thr Ser Ala Leu Cys Thr Gly Trp Val 
370 375 380 

Ala Gly Ser Leu His Tyr Asp 
385 390 



<210> 217 

<211> 231 

<2'12> PRT 

<213> Streptococcus pneumoniae 

<400> 217 

Met Leu Lys Trp Glu Asp Leu Pro Val Glu Met Lys Ser Ser Glu Val 
1 5 10 *15 

Glu Ser Tyr Tyr Gin Leu Val Ser Lys Arg Lys Gly Ser Leu He Phe 
20 25 30 

Lys Arg Cys Leu Asp Trp Val Leu Ala Leu Vai Leu Thr Trp Val Leu 
35 40 45 

Thr Ser Pro He Phe Leu He Leu Ser He Trp He Lys Leu Asp Ser 
50 55 60 

Lys Gly Pro Val He Tyr Lys Gin Glu Arg Val Thr Gin Tyr Asn Arg 
65 70 75 80 

Arg Phe Lys He Trp Lys Phe Arg Thr Met Val Thr Asp Ala Asp Lys 
85 90 95 

Lys Gly Ser Leu Val Thr Ser Ala Asn Asp Ser Arg He -Thr Lys Val 
100 ■ 105 110 

Gly Asn Phe He Arg Arg Val Arg Leu Asp Glu Leu Pro Gin Leu Val 
115 120 125 

Asn Val Leu Lys Gly Glu Met Ser Phe Val Gly Thr Arg Pro Glu Val 
130 135 140 

Pro Arg Tyr Thr Glu Gin Tyr Ser Pro Glu Met Met Ala Thr Leu Leu 
145 150 155 160 

Leu Gin Ala Gly He Thr Ser Pro Ala Ser He Asn Tyr Lys Asp Glu 
165 ■ 170 175 

Asp Thr He He Ser Gin Met Thr Glu Lys Gly Leu Ser Val Asp Gin 
180 185 190 
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Ala Tyr Val Glu His Val Leu Pro Glu Lys Met Arg Tyr Asn Leu Ala 
195 200 205 

Tyr Leu Arg Glu Phe Ser Phe Phe Gly Asp He Lys He Met Phe Gin 
210 215 220 

Thr Val Phe Glu Val Leu Lys 
225 230 



<210> 218 
<211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 218 

Met Thr Ser Pro Leu Leu Glu Ser Arg Arg Gin Leu Arg Lys Cys Ala 
• 1 5 10 15 

Phe Gin Ala Leu Met Ser Leu Glu Phe Gly Thr Asp Val Glu Thr Ala 
20 25 30 

Cys' Arg Phe Ala Tyr Thr His Asp Arg Glu Tyr Thr Asp Val Gin Leu 
35 40 45 

Pro Ala Phe Leu He Asp Leu Val Ser Gly Val Gin Ala Lys Lys Glu 
50 55 60 

Glu Leu Asp Lys Gin He Thr Gin His Leu Lys Ala Gly Trp Thr He 
65 70 75 80 

Glu Arg Leu Thr Leu Val Glu Arg Asn Leu Leu Arg Leu Gly Val Phe 
85 90 • .95 

Glu He Thr Ser Phe Asp Thr Pro Gin Leu Val Ala Val Asn Glu Ala 
100 105 110 

He Glu Leu Ala Lys Asp Phe Ser Asp Gin Lys Ser Ma Arg Phe He 
115 120 125 

Asn Gly Leu Leu Ser Gin Phe Val Thr Glu Glu Gin 
130 135 140 



<210> 219 

<211> 1179 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 219 

Met Tyr Leu Lys Glu He Glu He Gin Gly Phe Lys Ser Phe Ala Asp 
1 5 10 15. 

Lys Thr Lys Val Val Phe Asp Gin Gly Val Thr Ala Val Val Gly Pro 
20 25 30 

Asn Gly Ser Gly Lys Ser Asn He Thr Glu Ser Leu Arg Trp Ala Leu 
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35 



40 



45 



Gly Glu Ser Ser Val Lys Ser Leu Arg Gly Gly Lys Met Pro Asp Val 
50 55 60 

5 

lie Phe Ala Giy Thr Glu Ser Arg Lys Pro Leu Asn Tyr Ala Ser Val 
65 70 75 80 

Val Val Thr Leu Asp Asn His Asp Gly Phe lie Lys Asp Ala Gly Gin 
10 85 90 95 

Glu lie Arg Val Glu Arg His lie Tyr Arg Ser Gly Asp Ser Glu Tyr 
100 105 110 

15 • Lys lie Asp Gly Lys Lys Val Arg Leu Arg Asp He His Asp Leu Phe 
115 120 125 



20 



Leu Asp Thr Gly Leu Gly Arg Asp Ser Phe Ser lie lie Ser Gin Gly 
130 135 140 

Lys Val Glu Glu He Phe Asn Ser Lys Pro Glu Glu Arg Arg Ala He 
145 150 155 160 



Phe Glu Glu Ala Ala Gly Val Leu Lys Tyr Lys Thr Arg Arg Lys Glu 

25 165 170 175 

Thr Glu Ser Lys Leu Gin Gin Thr Gin Asp Asn Leu Asp Arg Leu Glu 

180 185 190 

30 Asp He He Tyr Glu Leu Asp Asn Gin lie Lys Pro Leu Glu Lys Gin 

195 . . 200 205 



35 



Ala Glu Asn Ala Arg Lys Phe Leu Asp Leu Glu Gly Gin Arg Lys Ala 
■ 210 215 220 

He Tyr Leu Asp Val Leu Val Ala Gin He Lys Glu Asn Lys Ala Glu 
225 ' 230 235 240 



Leu Glu Ser Thr Glu Glu Glu Leu Ala Gin Val Gin Glu Leu Leu Met 
40 245 250 255 

Ser Tyr Tyr Gin Lys Arg Glu Lys Leu Glu Glu Glu Asn Gin Thr Leu 
260 265 270 

45 Lys Lys Gin Arg Gin Asp Leu Gin Ala Glu Met Ala Lys Asp Gin Gly 
275 280 285 



50 



Ser Leu Met Asp Leu Thr Ser Leu He Ser Asp Leu Glu Arg Lys Leu 

290 295 300 

Ala Leu Ser Lys Leu Glu Ser Glu Gin Val Ala Leu Asn Gin Gin Glu 

305 310 315 320 



Ala Gin Ala Arg Leu Ala Ala Leu Glu Asp Lys Arg Asn Ser Leu Ser 
55 325 330 335 



Lys Glu Lys Tyr Asp Lys Glu Ser Ser Leu Ala Leu Leu Glu Gly Asn 
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340 



345 



350 



Leu Val Gin Asn Asn Gin Lys Leu Asn Arg Leu Glu Ala Glu Leu Leu 
355 360 365 

Ala Phe Ser Asp Asp Pro Asp Gin Met He Glu Leu Leu Arg Glu Arg 
370 375 380 

Phe Val Ala Leu Leu Gin Glu Glu Ala Asp Val Ser Asn Gin Leu Thr 
385 390 395 400 

Arg He Glu Asn "Glu Leu Glu Asn Ser Arg Gin Leu Ser Gin Lys Gin 
405 410 415 

Ala Asp Gin Leu Glu Lys Leu Lys Glu Gin Leu Ala Thr Ala Lys Glu 
420 425 430 

Lys Ala Ser Gin Gin Lys Asp Glu Leu Glu Thr Ala Lys Val Gin Val 
435 440 445 

Gin Lys Leu Leu Ala Asp Tyr. Gin Ala He Ala Lys Glu Gin Glu Glu 
450 455 460 

Gin Lys Thr Ser Tyr Gin Ala Gin Gin Ser Gin Leu Phe Asp Arg Leu 
465 470 475 480 

Asp Ser Leu Lys Asn Lys Gin Ala Arg Ala Gin Ser Leu Glu Asn He 
485 490 495 

Leu Arg Asn His Ser Asn' Phe Tyr Ala Gly Val Lys Ser Val Leu Gin 
500 505 510 

Glu Lys Asp Arg Leu Gly Gly He He Gly Ala Val Ser Glu His Leu 
515 520 525 

Thr Phe Asp Val Tyr Tyr Gin Thr Ala Leu Glu He Ala Leu Gly Ala 
530 535 540 

Ser Ser Gin His He He Val Glu Asp Glu Glu Ser Ala Thr Lys Ala 
545 550 555 560 

He Asp Phe Leu Lys Arg Asn Arg Val Gly Arg Ala Thr Phe Leu Pro 
565 570 575 

Leu Thr Thr He Lys Ala Arg Thr He Ser Ser Gin Asn Gin Asp Ala 
580 585 - 590 

He Ala Val Ser Pro Gly Phe Leu Gly Met Ala Asp Glu Leu Val Thr 
595 600 605 

Phe Asp Thr Arg Leu Glu Ala He Phe Lys Asn Leu Leu Ala Thr Thr 
610 615 620 

Ala He Phe Asp Thr Val Glu His Ala Arg Glu Ala Ala Arg Gin Val 
625 630 635 640 



Arg Tyr Gin Val Arg Met Val. Thr Leu Asp Gly Thr Glu Leu Arg Thr 
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645 



650 



655 



Gly Gly Ser Tyr Ala Gly Gly Ala Asn Arg Gin Asn Asn Ser He Phe 
660 665 670 

5 

He Lys Pro Glu Leu Glu Gin Leu Gin Lys Glu He Ala Ala Asp Glu 
675 680 685 

Ala Ser Leu Gly Ser Glu Glu Ala Ala Leu Lys Thr Leu Gin Asp Gin 
10 690 695 700 

Met Ala Ala Leu Thr Glu Arg Leu Glu Ala He Lys Ser Gin Gly Glu 
705 ' 710 715 720 

15 Gin Ala Arg lie Gin Glu Gin Gly Leu Ser Leu Ala Tyr Gin Gin Thr 

725 730 735 



20 



Ser Gin Gin Val Glu Glu Leu Glu Thr Leu Trp Lys Leu Gin Glu Glu 
740 745 750 

Glu He Asp Arg Leu Ser Glu Gly Asp Trp Gin Ala Asp Lys Glu Lys 
755 760 765 



25 



Cys Gin Glu Ser Leu Ala Thr He Ala Ser Asp Lys Gin Asn Leu Glu 
770 775 780 



Ala Glu He Glu Glu He Lys Ser Asn Lys Asn Ala He Gin Glu Arg 

785 790 795 800 

30 Tyr Gin Asn Leu Gin Glu Glu Val Ala- Gin Ala Arg Leu Leu Lys Thr 

805 810 815 



35 



Lys Leu Gin Gly Gin Lys Arg Tyr Glu Val Ala Asp He Glu Arg Leu 

820 825 830 

Gly Lys Glu Leu Asp Asn Leu Asn He Glu Gin Glu Glu He Gin Arg 

835 840 845 



Met Leu Gin Glu Lys Val Asp Asn Leu Glu Lys Val Asp Thr Glu Leu 
40 850 855 860 

Leu Ser Gin Gin Ala Glu Glu Ser Lys Thr Gin Lys Thr Asn Leu Gin 
865 870 875 880 

45 Gin Gly Leu He Arg Lys Gin Phe Glu Leu Asp Asp He Glu Gly Gin 

88'5 890 895 



50 



Leu Asp Asp He Ala Ser His Leu Asp Gin Ala Arg Gin Gin Asn Glu 
900 • 905 910 

Glu Trp He Arg Lys Gin Thr Arg Ala Glu Ala Lys Lys Glu Lys Val 
915 920 925 



55 



Ser Glu Arg Leu Arg His Leu Gin Asn Gin Leu Thr Asp Gin Tyr Gin 
930 935 940 



He Ser* Tyr Thr Glu Ala Leu Glu' Lys Ala His Glu Leu Glu Asn Leu 
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945 950 955 960 

Asn Leu Ala Glu Gin Glu Val Gin Asp Leu Glu Lys Ala He Arg Ser 
965 970 975 

Leu Gly Pro Val Asn Leu Glu Ala He Asp Gin Tyr Glu Glu Val His 
980 985 990 

Asn Arg Leu Asp Phe Leu Asn Ser Gin Arg Asp Asp He Leu Ser Ala 
995 1000 1005 

Lys Asn Leu Leu Leu Glu Thr He Thr Glu Met Asn Asp Glu Val Lys 
1010 1015 1020 

Glu Arg Phe Lys Ser Thr Phe Glu Ala He Arg Glu Ser Phe Lys Val 
1025 1030 1035 1040 

Thr Phe Lys Gin Met Phe Gly Gly Gly Gin Ala Asp Leu He Leu Thr 
1045 1050 1055 

Glu Gly Asp Leu Leu Thr Ala Gly Val Glu He Ser Val Gin Pro Pro 
1060 1065 1070 

Gly Lys Lys He Gin Ser Leu Asn Leu Met Ser Gly Gly Glu Lys Ala 
1075 1080 1085 

Leu Ser Ala Leu Ala Leu Leu Phe Ser He He Arg Val Lys Thr He 
1090 1095 1100 

Pro Phe Val He Leu Asp Glu Val Glu Ala Ala Leu Asp Glu Ala Asn 
1105 1110 1115 1120 

Val Lys Arg Phe Gly Asp Tyr Leu Asn Arg Phe Asp Lys Asp Ser Gin 
1125 1130 . 1135 

Phe He Val Val Thr His Arg Lys Gly Thr Met Ala Ala Ala 'Asp Ser 
1140 1145 1150 

He Tyr Gly Val Thr Met Gin Glu Ser Gly Val Ser Lys He Val Ser 
1155 1160 1165 

Val Lys Leu Lys Asp Leu Glu Ser He Glu Gly 
1170 1175 



<210> 220 
<211> 447 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 220 

Met Thr Lys Arg Val Thr He lie Asp Val Lys Asp Tyr Val Gly Gin 
15 10 15 

Glu Val Thr He Gly Ala Trp Val Ala Asn Lys Ser Gly Lys Gly Lys 
20 25 30 
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lie Ala Phe Leu Gin Leu Arg Asp Gly Thr Ala Phe Phe Gin Gly Val 
35 40 45 

Ala Phe Lys Pro Asn Phe Val Glu Lys Phe Gly Glu Glu Val Gly Leu 
50 55 60 

Glu Lys Phe Asp Val He Lys Arg Leu Ser.Gln Glu Thr Ser Val Tyr 
65 70 75 80 

Val Thr Gly He Val Lys Glu Asp Glu Arg Ser Lys Phe Gly Tyr Glu 
85 90 95 

Leu Asp He Thr Asp He Glu Val He Gly Glu Ser Gin Asp Tyr Pro 
100 105 110 

He Thr Pro Lys Glu His Gly Thr Asp Phe Leu Met Asp Asn Arg His 
115 120 125 

Leu Trp Leu Arg Ser Arg Lys Gin Val Ala Val Leu Gin He Arg Asn 
130 135 140 

Ala He He Tyr Ala Thr Tyr Glu Phe Phe Asp Lys Asn Gly Phe Met 
145 150 155 160 

Lys Phe Asp Ser Pro He* Leu Ser Gly Asn Ala Ala Glu Asp Ser Thr 
165 170 175 

Glu Leu Phe Glu Thr Asp Tyr Phe Gly Thr Pro Ala Tyr Leu Ser Gin 
180 185 190 

Ser Gly Gin Leu Tyr Leu Glu Ala Gly Ala Met Ala Leu Gly Arg Val 
195 200 205 

Phe Asp Phe Gly Pro Val Phe Arg Ala Glu Lys Ser Lys Thr Arg Arg 
210 215 220 

His Leu Thr' Glu Phe Trp Met Met Asp Ala Glu Tyr Ser Tyr Leu Thr 
225 230 235 240 

His Asp Glu Ser Leu Asp Leu Gin Glu Ala Tyr Val Lys Ala Leu Leu 
245 250 255 

Gin Gly Val Leu Asp Arg Ala Pro Gin Ala Leu Glu Thr Leu Glu Arg 
260 265 270 

Asp Thr Glu Leu Leu Lys Arg Tyr He Ala Glu Pro Phe Lys. Arg lie 
275 280 ... , 285 

Thr Tyr Asp Gin Ala He Asp Leu Leu Gin Glu His Glu Asn Asp Glu 
290 295 300 

Asp Ala Asp Tyr Glu His Leu Glu His Gly Asp Asp Phe Gly Ser Pro 
305 310 315 320 



His Glu Thr Trp He Ser Asn His Phe Gly Val Pro Thr Phe Val Met 
325 330 335 
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Asn Tyr Pro Ala Ala lie Lys Ala Phe Tyr Met Lys Pro Val Pro Gly 
■340 345 350 

Asn Pro Glu Arg Val Leu Cys Ala Asp Leu Leu Ala Pro Glu Gly Tyr 
5 355 360 365 

Gly Glu lie lie Gly Gly Ser Met Arg Glu Glu Asp Tyr Asp Ala Leu 
370 375 380 

10 Val Ala Lys Met Asp Glu Leu Gly Met Asp Arg Thr Glu Tyr Glu Phe 
385 390 .395 400 

Tyr Leu Asp Leu Arg Lys Tyr Gly Thr Val Pro His Gly Gly Phe Gly 
405 410 415 

15 

lie Gly He Glu Arg Met Val Thr Phe Ala Ala Gly Thr Lys His He 
420 425 430 

Arg Glu Ala He Pro Phe Pro Arg Met Leu His Arg He Lys Pro 
20 435 440 445 



<210> 221 
<211> 308 
25 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 221 

Met Ser Glu Lys . Leu Val Glu He Lys Asp Leu Glu He Ser Phe Gly 
30 1 5 10 15 

Glu Gly Ser Lys Lys Phe Val. Ala Val Lys Asn Ala Asn Phe Phe He 
20 25 30 

35 Asn Lys Gly Glu Thr Phe Ser Leu Val Gly Glu Ser Gly Ser Gly Lys 
35 40 45 

Thr Thr He Gly Arg Ala He He Gly Leu Asn Asp Thr Ser Asn Gly 
50 55 60 

40 

Asp He He Phe Asp Gly Gin Lys He Asn Gly Lys Lys Ser Arg Glu 
65 70 75 80 

Gin Ala Ala Glu Leu He Arg Arg He Gin Met lie Phe Gin Asp Pro 
45 .85 90 95 

Ala Ala Ser Leu Asn Glu Arg Ala Thr Val Asp Tyr lie lie Ser Glu 
100 105 ' ' 110 

50 Gly Leu Tyr Asn His Arg Leu Phe Lys Asp Glu Glu Glu Arg Lys Glu 
115 120 125 

Lys Val Gin Asn He lie Arg Glu Val Gly Leu Leu Ala Glu His Leu 
130 135 140 

55 

Thr Arg Tyr Pro His Glu Phe Ser Gly Gly Gin Arg Gin Arg He Gly 
145 150 155 160 
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lie Ala Arg Ala Leu Val Met Gin Pro Asp Phe Val lie Ala Asp Glu 
165 170 175 

Pro lie Ser Ala Leu Asp Val Ser Val Arg Ala Gin Val Leu Asn Leu 
180 185 190 

Leu Lys Lys Phe Gin Lys Glu Leu Gly Leu Thr Tyr Leu Phe lie Ala 
195 200 205 

His Asp Leu Ser Val Val Arg Phe lie Ser Asp Arg He Ala Val lie 
210 215 220 

Tyr Lys Gly Val He Val Glu Val Ala Glu Thr Glu Glu Leu Phe Asn 
225 230 235 240 

Asn Pro He His Pro Tyr Thr Gin Ala Leu Leu Ser Ala Val Pro He 
245 250 . 255 

Pro Asp Pro He Leu Glu Arg Lys Lys Val Leu Lys Val Tyr Asp Pro 
260 265 270 

Ser Gin His Asp Tyr Glu Thr Asp Lys Pro Ser Met Val Glu He Arg 
275 280 285 

Pro Gly His Tyr Val Trp Ala Asn Gin Thr Glu Leu Ala Arg Tyr Gin 
290 295 300 

Lys Gly Leu Asn 
305 



<210> 222 
<211> 424 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 222 

Met Lys He Ser Trp Asn Gly Phe Ser Lys Lys Ser Tyr Gin Glu Arg 
1 5 10 15 

Leu Glu Leu Leu Lys Ala Gin Ala Leu Leu Ser Pro Glu Arg Gin Ala 
20 25 30 

Ser Leu Glu Lys Asp Glu Gin Met Ser Val Thr Val Ala Asp. Gin Leu 
35 40 . 45 

Ser Glu Asn Val Val -Gly Thr Phe Ser'Leu Pro Tyr Ser Leu Val Pro 
50 55 60 

Glu Val Leu Val Asn Gly Gin Glu Tyr Thr Val Pro Tyr Val Thr Glu 
. 65 70 75 80 

Glu Pro Ser Val Val Ala Ala Ala Ser Tyr Ala Ser Lys He He Lys 
85 90 95 

Arg Ala Gly Gly Phe Thr Ala Gin Val His Gin Arg Gin Met He Gly 
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100 105 110 



Gin Val Ala Leu Tyr Gin lie Ala Asn Pro Lys Leu Ala Gin Glu Lys 
115 120 125 

lie Ala Ser Lys Lys Ala Glu Leu Leu Glu Leu Ala Asn Gin Ala Tyr 
130 135 140 

Pro Ser lie Val Lys Arg Gly Gly Gly Ala Arg Asp Leu His Val Glu 
145 150 155 160 



Gin He Lys Gly Glu Pro Asp Phe 
165 

Thr Gin Glu Ala Met Gly Ala Asn 
180 

Leu Lys Pro Val Leu Glu Glu Leu 
195 200 

He Leu Ser Asn Tyr Ala Thr Asp 
210 215 



Leu Val Val Tyr He His Val Asp 
170 175 

Met Leu Asn Thr Met Leu Glu Ala 
185 190 

Ser Gin Gly Gin Ser* Leu Met Gly 
205 

Ser Leu Val Thr Ala Ser Cys Arg 
220 



He Ala Phe Arg Tyr Leu Ser Arg 
225 230 

Ala Glu Lys He Ala Leu Ala Ser 
245 

Arg Ala Ala Thr His Asn Lys Gly 
260 

Leu He Ala Thr Gly Asn Asp Trp 
275 280 

Ala Phe Ala Ser Arg Asp Gly Arg 
290 295 

Leu Asp Leu Glu Arg Glu Glu Leu 
305 310 



Gin Lys Asp Gin Gly Arg Glu He 
235 240 

Gin Phe Ala Gin Ala Asp Pro Tyr 
250 255 

He Phe Asn Gly He Asp Ala He 
265 270 

Arg Ala He Glu Ala Gly Ala His 
285 

Tyr Gin Gly Leu Ser Cys Trp Thr 
300 

Val Gly Glu Met Thr Leu Pro Met 
315 320 



Pro Val Ala Thr Lys Gly Gly Ser 
325 

Leu Ser His Asp Leu Leu Gly Asn 
340 

He He Val Ser He Gly Leu Ala 
355 360 

Leu Val Ser Thr Gly He Gin Gin 
370 375 

Ser Leu Ala Leu Leu Ala Gly Ma 
385 390 

Val Glu Airg Leu He Ser Asp Lys 



He Gly Leu Asn Pro Arg Val Ala 
330 335 ' 

Pro Ser Ala Arg Glu Leu Ala Gin 
345 350 

Gin Asn Phe Ala Ala Leu Lys Ala 
365 

Gly His Met Lys Leu Gin Ala Lys 
380 

Ser Glu Ser Glu Val Ala Pro Leu 

395 400 

Thr Phe Asn Leu Glu Thr Ala Gin 
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405 



410 



415 



Arg Tyr Leu Glu Asn Leu Arg Ser 
420 . 



<210> 223 
<211> 262 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 223 

Met Pro lie Thr Ser Leu Glu He Lys Asp Lys Thr Phe Gly Thr Arg 
1 5 10 15 

Phe Arg Gly Phe Asp Pro Glu Glu Val Asp Glu Phe Leu Asp He Val 
20 25 30 

Val Arg Asp Tyr Glu Asp Leu Val Arg Ala Asn His Asp Lys Asn Leu 
35 40 45 

Arg He Lys Ser Leu Glu Glu Arg Leu Ser Tyr Phe Asp Glu He Lys 
50 55 60 

Asp Ser Leu Ser Gin Ser Val Leu He Ala Gin Asp Thr Ala Glu Arg 
65 70 75 80 

Val Lys Gin Ala Ala His Glu Arg Ser Asn Asn He He His Gin Ala 
85 90 95 

Glu Gin Asp Ala Gin Arg Leu Leu Glu Glu Ala Lys Tyr Lys Ala Asn 
100 105 110 

Glu He Leu Arg Gin Ala Thr Asp Asn Ala Lys Lys Val Ala Val Glu 
115 120 125 

Thr Glu Glu Leu Lys Asn Lys Ser Arg Val Phe His Gin Arg Leu Lys 
130 135 140 

Ser Thr He Glu Ser Gin Leu Ala He Val Glu Ser Ser Asp Trp Glu 
145 150 155 ■ 160 

Asp He Leu Arg Pro Thr Ala Thr Tyr Leu Gin Thr Ser Asp Glu Ala 
165 170 17 5 

Phe Lys Glu Val Val Ser Glu Val Leu Gly Glu Pro He Pro Ala Pro 
180 185 190 

He Glu Glu Glu Pro He Asp Met Thr Arg Gin Phe Ser Gin Ala Glu 
195 200 205 

Met Ala Glu Leu Gin Ala Arg He Glu Val Ala Asp Lys Glu Leu Ser 
210 215 220 

Glu Phe Glu Ala Gin He Lys Gin Glu Val Glu Ala Pro Thr Pro Val 
225 230 235 240 
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Val Ser Pro Gin Val Glu Glu Glu Pro Leu Leu lie Gin Leu Ala Gin ' 
245 250 255 

Cys Met Lys Asn Gin Lys 
5 260 



<210> 224 
<211> 575 
10 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 224 

Met Ser Asn Gly Gin Leu He Tyr Leu Met Val Ala He Ala Val He 
.15 1 5 10 15 

Leu Val Leu Ala Tyr Val Val Ala He Phe Leu Arg Lys Arg Asn Glu 
20 25 30 

20 Gly Arg Leu Glu Ala Leu Glu Glu Arg Lys' Glu Glu Leu Tyr Asn Leu 
35 40 45 



25 



Pro Val Asn Asp Glu Val Glu Ala Val Lys Asn Met His Leu He Gly 
50 55 60 

Gin Ser Gin Val Ala Phe Arg Glu Trp Asn Gin Lys Trp Val Asp Leu 
65 70 75 80 



Ser Leu Asn Ser Phe Ala Asp He Glu Asn Asn Leu Phe Glu Ala Glu 
30 85 90 95 

Gly Tyr Asn His Ser Phe Arg Phe Leu Lys Ala Ser His Gin He Asp 
100 105 • 110 

35 Gin He Glu Ser Gin He Thr Leu He Glu Glu Asp He Ala Ala He 
115 120 125 

Arg Asn Ala Leu Ala Asp Leu Glu Lys Gin Glu Ser Lys Asn Ser Gly 
130 135 140 

Arg Val, Leu His Ala Leu Asp Leu Phe Glu Glu Leu Gin His Arg Val 
145 150 155 160 

Ala Glu Asn Ser Glu Gin Tyr Gly Gin Ala Leu Asp Glu He Glu Lys 
165 170 175 

Gin Leu Glu Asn He Gin Ser Glu Phe Ser Gin Phe Val Thr Leu Asn 
180 185 " 190 

50 Ser Ser Gly Asp Pro Val Glu Ala Ala Val He Leu Asp Asn Thr Glu 
195 200 205 



55 



Asn His He Leu Ala Leu Ser His He Val Asp Arg Val Pro Ala Leu 

210 215 220 

Val Thr Thr Leu. Ser Thr Glu Levi Pro Asp Gin Leu Gin Asp Leu Glu 

225 230 235 240 
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Ala Gly Tyr Arg Lys Leu lie Asp Ala Asn Tyr His Phe Val Glu Thr 
245 250 255 

5 Asp lie Glu Ala Arg Phe His Leu Leu Tyr Glu Ala Phe Lys Lys Asn 
260 . 265 270 

Gin Glu Asn lie Arg Gin Leu Glu Leu Asp Asn Ala Glu Tyr Glu Asn 
275 280 285 



10 



Gly Gin Ala Gin Glu Glu lie Asn Ala Leu Tyr Asp lie Phe Thr Arg 
290 295 300 



Glu lie Ala Ala Gin Lys Val Val Glu Asn Leu Leu Ala Thr Leu Pro 
15 305 310 315 320 

Thr Tyr Leu Gin His Met Lys Glu Asn. Asn Thr Leu Leu Gly Glu Asp 
325 - 330 335 

20 lie Ala Arg Leu Asn Lys Thr Tyr Leu Leu Pro Glu Thr Ala Ala Ser 
340 345 350 



25 



His Val Arg Arg lie Gin Thr Glu Leu Glu Ser Phe Glu Ala Ala lie 
355 360 365 

Val Glu Val Thr Ser Asn Gin Glu Glu Pro Thr Gin Ala Tyr Ser Val 
370 375 380 



Leu Glu Glu Asn Leu Glu Asp Leu Gin Thr Gin Leu Lys Asp lie. Glu 

30 385 390 395 400 

Asp Glu Gin lie Ser Val Ser Glu Arg Leu Thr Gin lie Glu Lys Asp 
405 410 415 

35 Asp lie Asn Ala Arg Gin Lys Ala Asn Val Tyr Val Asn Arg Leu His 
420 425 430 



Thr lie Lys Arg Tyr Met Glu 
435 

40 

Thr Phe Leu Lys Leu Phe Phe 
450 455 

Met Val Glu Leu Glu Gin Lys 
45 465 470 

Val Leu Glu lie Ala Thr Asn 
485 

50 Tyr Asn He Val Glri Tyr Ala 
500 

Ser Asn Arg Tyr Arg Ser Phe 
515 

55 

Glu Ala Leu Asp He Phe Glu 
530 535 



Lys Arg Asn Leu Pro Gly He Pro Gin 
440 445 

Thr Ala Ser Asn Asn Thr Glu Asp Leu 
460 

Met He Asn He Glu Ser Val Thr Arg 
475. 480 

Asp Met_Glu Ala Leu Glu Thr Glu Thr 
490 495 

Thr Leu Thr Glu Gin Leu Leu Gin Tyr 
505 ' 510 

Asp Glu. Arg He Gin Glu Ala Phe Asn 
520 525 

Lys Glu Phe Asp Tyr His Ala Ser Phe 
540 
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Asp Lys He Ser Gin Ala Leu Glu Val Ala Glu Pro Gly Val Thr Asn 

545 550 555 560 

5 Arg Phe Val Thr Ser Tyr Glu Lys Thr Arg Glu Thr He Arg Phe 

565 570 575 



<210> 225 
10 <211> 800 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 225 

15 Met Leu He Ser Tyr Lys Trp Leu Lys Glu Leu Val Asp He Asp Val 
1 ■ 5 10 15 

Pro Ser Gin Glu Leu Ala Glu Lys Met Ser Thr Thr Gly He Glu Val 
20 25 30 

20 

Glu Gly Val Glu Ser Pro Ala Ala Gly Leu Ser Lys He Val Val Gly 
35 40 45 

Glu Val Leu Ser Cys Glu Asp Val Pro Glu Thr His Leu His Val Cys 
25 50 55 60 

Gin Val Asn Val Gly Glu Glu Glu Arg Gin He Val Cys Gly Ala Pro 
65 70 75 80 

30 Asn Val Arg Ala Gly He Lys Val Met Val Ala Leu Pro Gly Ala Arg 

85 90 95 



35 



He Ala Asp Asn Tyr Lys lie Lys Lys Gly Lys He Arg Gly Leu Glu 
100 105 110 

Ser Leu Gly Met He Cys Ser Leu Gly Glu Leu Gly He Ser Asp Ser 
115 120 125 



Val Val Pro Lys Glu Phe Ala Asp Gly He Gin He Leu Pro Glu Asp 
40 130 135 140 

Ala Val Pro Gly Glu Glu Val Phe Ser Tyr Leu Asp Leu Asp Asp Glu 

145 150 155 160 

45 lie lie Glu Leu Ser lie Thr Pro Asn Arg Ala Asp Ala Leu Ser Met 

165 170 175 



50 



Cys Gly Val Ala His Glu Val Ala Ala He Tyr Asp Lys Ala Val Asn 
180 185 190 

Phe Lys Glu Phe Thr Leu Thr Glu Thr Asn Glu Ala Ala Ala Asp Ala 
195 200 205 



Leu Ser Val Ser He Glu Thr Asp Lys Ala Pro Tyr Tyr Ala Ala Arg 
55 210 215 220 

lie Leu Asp Asn Val Thr lie Ala Pro Ser Pro Gin Trp Leu Gin Asn 
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225 



230 



235 



240 



Leu Leu Met Asn Glu Gly lie Arg Pro lie Asn Asn Val Val Asp Val 
245 250 255 

5 

Thr Asn Tyr lie Leu Leu Tyr Phe Gly Gin Pro Met His Ala Phe Asp 

260 ■ 265 270 

Leu Asp Asn Phe Glu Gly Thr Asp He Arg Val Arg Glu Ala Arg .Ala 
10 275 280 285 



15 



Gly Glu Lys Leu Val Thr Leu Asp Gly Glu Glu Arg Asp Leu Asp Val 
290 295 300 

Asn Asp Leu Val He Thr Val Ala Asp Lys Pro Val Ala Leu Ala Gly 
305 310 315 320 



20 



Val Met Gly Gly Gin Ala Thr Glu He Ser Glu Lys Ser Ser Arg Val 
325 330 335 

Val Leu Glu Ala Ala Val Phe Asn Gly Lys Ser He Arg Lys Thr Ser 
340 345 350 



Gly Arg Leu Asn Leu Arg Ser Glu Ser Ser Ser Arg Phe Glu Lys Gly 
25 355 360 365 

He Asn Val Ala Thr Val Asn Glu Ala Leu Asp Ala Ala Ala Ser Leu 
370 375 380 

30 He Ala Glu Leu Ala Gly Ala Thr Val Arg Lys Gly He Val Ser Ala' 
385 390 . 395 400 



35 



Gly Glu Leu Asp Thr Ser Asp Val Glu Val Ser Ser Thr Leu Ala Asp 
405 410 415 

Val Asn Arg Val Leu Gly Thr Glu Leu Ser Tyr Ala Asp Val Glu Asp 

420 425 430 



Val Phe Arg Arg Leu Gly Phe Gly Leu Ser Gly Asn Ala Asp Ser Phe 
40 435 440 445 



45 



Thr Val Arg Val Pro Arg Arg Arg Trp Asp He Thr lie Glu Ala Asp 
450 455 460 

Leu Phe Glu Glu lie Ala Arg He Tyr Gly Tyr Asp Arg Leu Pro Thr 

465 " 470 475 480 



50 



Ser Leu Pro Lys Asp Asp Gly Thr Ala Gly Glu Leu Thr Ala Thr Gin 
485 490 495 

Lys Leu Arg Arg Gin Val Arg Thr He Ala Glu Gly Ala Gly Leu Thr 
500 505 510 



Glu He He Thr Tyr Thr Leu Thr Thr Pro Glu Lys Ala Val Glu Phe 
55 515 520 525 



Thr Ala Gin Pro Ser Asn Leu Thr Glu Leu Met Trp Pro Met Thr Val 
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530 



535 



540 



Asp Arg Ser Val Leu Arg Gin Asn Met lie Ser Gly lie Leu Asp Thr 
545 550 555 560 

5 

Val Ala Tyr Asn Val Ala Arg Lys Asn Lys Asn Leu Ala Leu Tyr Glu 
565 570 575 

lie Gly Lys Val Phe Glu Gin Thr Gly Asn Pro Lys Glu Glu Leu Pro 
10 580 585 590 

Asn Glu He Asn Ser Phe Ala Phe Ala Leu Thr Gly Leu Val Ala Glu 
595 600 605 

15 Lys Asp Phe Gin Thr Ala Ala Val Pro Val Asp Phe Phe Tyr Ala Lys 
610 615 620 



20 



Gly He Leu Glu Ala Leu Phe Thr Arg Leu Gly Leu Gin Val Thr Tyr 
625 630 635 640 

Thr Ala Thr Ser Glu He Ala Ser Leu His Pro Gly Arg Thr Ala Val 
645 650 655 



He Ser Leu Gly Asp Gin Val Leu Gly Phe Leu Gly Gin Val His Pro 
25 660 665 670 

Val Thr Ala Lys Ala Tyr Asp He Pro Glu Thr Tyr Val Ala Glu Leu 
675 680 685 

30 Asn Leu Ser Ala He Glu Ala Ala Leu Gin Pro Ala Thr Pro Phe Val 
690 695 700 



35 



Glu He Thr Lys Phe Pro Ala Val Ser Arg Asp Val Ala Leu Leu Leu 

705 710 715 720 

Lys Ala Glu Val Thr His Gin Glu Val Val Asp Ala He Gin Ala Ala 

725 730 735 



Gly Val Lys Arg Leu Thr Asp He Lys Leu Phe Asp Val Phe Ser Gly 
40 740 745 750 

Glu Lys Leu Gly Leu Gly Met Lys Ser Met Ala Tyr Ser Leu Thr Phe 
755 760 765 

45 Gin Asn Pro Glu Asp Ser Leu Thr Asp Glu Glu Val Ala Arg Tyr Met 
770 775 780 



50 



Glu Lys He Gin Ala Ser Leu Glu Glu Lys Val Asn Ala Glu Val Arg 
785 790 795 800 



55 <210> 226 
<211> 180 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 226 

Met Leu Glu Asn Asp lie Lys Lys 
1 5 

Thr Glu Ala Ala Lys Lys Leu Gly 
20 

Gly Lys Asn Pro He Leu Val Gly 

35 40 

Met Ala Glu Leu Val Lys His He 
50 55 

Met Met Val Ser Ser Tyr His Gly 
65 70 

Asn He Lys Gin Asp Val Thr Gin 
85 



Val Leu Val Ser His Asp Glu He 
10 15 

Ala Gin Leu Thr Lys Asp Tyr Ala 
25 30 

lie Leu Lys Gly Ser He Pro Phe 
45 

Asp Thr His lie Glu Met Asp" Phe 
60 

Gly Thr Ala Ser Ser Gly Val lie 
75 80 

Asp lie Lys Gly Arg His Val Leu 
90 95 



. Phe Val Glu Asp He lie Asp Thr 
100 

Asp Met Phe Lys Ala Arg Glu Ala 
115 120 

Leu Asp Lys Pro Glu Gly Arg Val 
130 135 

Cys Phe Thr lie Pro Asn Glu Phe 
145 * ' 150 

Lys Glu Asn Tyr Arg Asn Leu Pro 
165 

Val Tyr Ser Asn 
180 



Gly Gin Thr Leu Lys Asn Leu Arg 
105 110 

Ala Ser Val Lys He. Ala Thr Leu 
125 

Val Glu lie Glu Ala Asp Tyr Thr 
140 

Val Val Gly Tyr Gly Leu Asp Tyr 
155 160 

Tyr He Gly Val Leu Lys Glu Glu 
170 175 
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